Presentation 2002/12/12
Language Model Switching Based on Topic Detection for Dialog Speech Recognition
Ian R. LANE, Tatsuya KAWAHARA, Tomoko MATSUI, Satoshi NAKAMURA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) An efficient, scalable speech recognition architecture is proposed for multi-domain dialog systems by combining topic detection and topic-dependent language modeling. The inferred domain is automatically detected from the user's utterance, and speech recognition is then pefformed with an appropriate domain-dependent language model. The architecture improves accuracy and efficiency over current approaches and is scaleable to a large number of domains. In this paper, unigram likelihood and SVM based topic detection methods are compared. A novel framework using a multi-layer hierarchy of language models is also introduced in order to improve robustness against topic detection errors. The proposed system provides a relative reduction in WER of 10.3% over a single language model system. Furthermore, it achieves an accuracy that is comparable to using multiple language models in parallel while requiring only a fraction of the computational cost.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Speech Recognition / Dialog Speech / Topic Detection / Support Vector Machines / Multi-domain Dialog Systems
Paper # SP2002-145
Date of Issue

Conference Information
Committee SP
Conference Date 2002/12/12(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Language Model Switching Based on Topic Detection for Dialog Speech Recognition
Sub Title (in English)
Keyword(1) Speech Recognition
Keyword(2) Dialog Speech
Keyword(3) Topic Detection
Keyword(4) Support Vector Machines
Keyword(5) Multi-domain Dialog Systems
1st Author's Name Ian R. LANE
1st Author's Affiliation School of Informatics, Kyoto University:ATR Spoken Language Translation Laboratories()
2nd Author's Name Tatsuya KAWAHARA
2nd Author's Affiliation School of Informatics, Kyoto University:ATR Spoken Language Translation Laboratories
3rd Author's Name Tomoko MATSUI
3rd Author's Affiliation ATR Spoken Language Translation Laboratories
4th Author's Name Satoshi NAKAMURA
4th Author's Affiliation ATR Spoken Language Translation Laboratories
Date 2002/12/12
Paper # SP2002-145
Volume (vol) vol.102
Number (no) 529
Page pp.pp.-
#Pages 6
Date of Issue