ニュース音声認識のための言語モデルと音響モデルの検討

大附 克年; 古井 貞煕; 桜井 直之; 岩崎 淳; 張 志鵬

Presentation	1998/12/11 Language Modeling and Acoustic Modeling for Automatic Transcription of Japanese Broadcast-News Speech Katsutoshi Ohtsuki, Sadaoki Furui, Naoyuki Sakurai, Atsushi Iwasaki, Zhi-Peng Zhang,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we report on language modeling and acoustic modeling studies for broadcast-news speech recognition. We have been working on the development of a large-vocabulary continuous speech recognition(LVCSR)system for Japanese broadcast-news speech transcription. We constructed a language model that depended on the readings of words, whereas, usual language models depend on written words. In broadcast-news, each speaker utters several sentences in succussion, therefore we applied on-line speaker adaptation which is applied after identifying a speaker of the sentence. The reading-dependent language model reduced word error rate by about 10%, and the on-line speaker adaptation reduced word error rate by about 15%. We propose a new formulation for speech recognition, which maximizes the a posteriori probability of the speaker's intended message for a given observed acoustic sequence. We applied this formulation to rescoring N-best hypotheses and achieved better results with it.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	LVCSR / broadcast-news speech / n-gram / on-line speaker adaptation / message-driven speech recognition
Paper #	NLC98-44,SP98-108
Date of Issue

Conference Information
Committee	NLC
Conference Date	1998/12/11(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Language Modeling and Acoustic Modeling for Automatic Transcription of Japanese Broadcast-News Speech
Sub Title (in English)
Keyword(1)	LVCSR
Keyword(2)	broadcast-news speech
Keyword(3)	n-gram
Keyword(4)	on-line speaker adaptation
Keyword(5)	message-driven speech recognition
1st Author's Name	Katsutoshi Ohtsuki
1st Author's Affiliation	NTT Human Interface Laboratories()
2nd Author's Name	Sadaoki Furui
2nd Author's Affiliation	Tokyo Institute of Technology, Department of Computer Science
3rd Author's Name	Naoyuki Sakurai
3rd Author's Affiliation	Tokyo Institute of Technology, Department of Computer Science
4th Author's Name	Atsushi Iwasaki
4th Author's Affiliation	Tokyo Institute of Technology, Department of Computer Science
5th Author's Name	Zhi-Peng Zhang
5th Author's Affiliation	Tokyo Institute of Technology, Department of Computer Science
Date	1998/12/11
Paper #	NLC98-44,SP98-108
Volume (vol)	vol.98
Number (no)	461
Page	pp.pp.-
#Pages	7
Date of Issue