話題同定に基づく言語モデル切替えによる対話音声認識

Presentation	2002/12/12 Language Model Switching Based on Topic Detection for Dialog Speech Recognition Ian R. LANE, Tatsuya KAWAHARA, Tomoko MATSUI, Satoshi NAKAMURA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	An efficient, scalable speech recognition architecture is proposed for multi-domain dialog systems by combining topic detection and topic-dependent language modeling. The inferred domain is automatically detected from the user's utterance, and speech recognition is then pefformed with an appropriate domain-dependent language model. The architecture improves accuracy and efficiency over current approaches and is scaleable to a large number of domains. In this paper, unigram likelihood and SVM based topic detection methods are compared. A novel framework using a multi-layer hierarchy of language models is also introduced in order to improve robustness against topic detection errors. The proposed system provides a relative reduction in WER of 10.3% over a single language model system. Furthermore, it achieves an accuracy that is comparable to using multiple language models in parallel while requiring only a fraction of the computational cost.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Speech Recognition / Dialog Speech / Topic Detection / Support Vector Machines / Multi-domain Dialog Systems
Paper #	SP2002-145
Date of Issue

Paper Information
Registration To	Speech (SP)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Language Model Switching Based on Topic Detection for Dialog Speech Recognition
Sub Title (in English)
Keyword(1)	Speech Recognition
Keyword(2)	Dialog Speech
Keyword(3)	Topic Detection
Keyword(4)	Support Vector Machines
Keyword(5)	Multi-domain Dialog Systems
1st Author's Name	Ian R. LANE
1st Author's Affiliation	School of Informatics, Kyoto University:ATR Spoken Language Translation Laboratories()
2nd Author's Name	Tatsuya KAWAHARA
2nd Author's Affiliation	School of Informatics, Kyoto University:ATR Spoken Language Translation Laboratories
3rd Author's Name	Tomoko MATSUI
3rd Author's Affiliation	ATR Spoken Language Translation Laboratories
4th Author's Name	Satoshi NAKAMURA
4th Author's Affiliation	ATR Spoken Language Translation Laboratories
Date	2002/12/12
Paper #	SP2002-145
Volume (vol)	vol.102
Number (no)	529
Page	pp.pp.-
#Pages	6
Date of Issue