Presentation | 2002/12/12 Language Model Switching Based on Topic Detection for Dialog Speech Recognition Ian R. LANE, Tatsuya KAWAHARA, Tomoko MATSUI, Satoshi NAKAMURA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | An efficient, scalable speech recognition architecture is proposed for multi-domain dialog systems by combining topic detection and topic-dependent language modeling. The inferred domain is automatically detected from the user's utterance, and speech recognition is then pefformed with an appropriate domain-dependent language model. The architecture improves accuracy and efficiency over current approaches and is scaleable to a large number of domains. In this paper, unigram likelihood and SVM based topic detection methods are compared. A novel framework using a multi-layer hierarchy of language models is also introduced in order to improve robustness against topic detection errors. The proposed system provides a relative reduction in WER of 10.3% over a single language model system. Furthermore, it achieves an accuracy that is comparable to using multiple language models in parallel while requiring only a fraction of the computational cost. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Speech Recognition / Dialog Speech / Topic Detection / Support Vector Machines / Multi-domain Dialog Systems |
Paper # | SP2002-145 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2002/12/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Language Model Switching Based on Topic Detection for Dialog Speech Recognition |
Sub Title (in English) | |
Keyword(1) | Speech Recognition |
Keyword(2) | Dialog Speech |
Keyword(3) | Topic Detection |
Keyword(4) | Support Vector Machines |
Keyword(5) | Multi-domain Dialog Systems |
1st Author's Name | Ian R. LANE |
1st Author's Affiliation | School of Informatics, Kyoto University:ATR Spoken Language Translation Laboratories() |
2nd Author's Name | Tatsuya KAWAHARA |
2nd Author's Affiliation | School of Informatics, Kyoto University:ATR Spoken Language Translation Laboratories |
3rd Author's Name | Tomoko MATSUI |
3rd Author's Affiliation | ATR Spoken Language Translation Laboratories |
4th Author's Name | Satoshi NAKAMURA |
4th Author's Affiliation | ATR Spoken Language Translation Laboratories |
Date | 2002/12/12 |
Paper # | SP2002-145 |
Volume (vol) | vol.102 |
Number (no) | 529 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |