Presentation | 2002/12/13 Mis-recognized Utterance Detection Using Multiple Language Models Generated by Clustered Sentences Katsuhisa FUJINAGA, Hiroaki KOKUBO, Hirofumi YAMAMOTO, Genichiro KIKUI, Hiroshi SHIMODAIRA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we propose a new method that detects mis-recognized utterances, based on voting scheme like ROVER. ROVER has two serious problems, 1) it is difficult to construct multiple speech recognition systems (SRSs), 2) calculation cost increases according to the number of SRSs. In contrast to the conventional ROVER, the proposed method uses multiple language models (LMs), general LM and sub LMs generated by clustered sentence, instead of different SRSs. Speech recognition with sub LMs is proceeded by rescoring, instead of parallel decoding. Through experiments, the proposed method resulted in 18-point higher precision with 10% loss of recall from baseline, and 22-point higher precision with 20% loss of recall. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | ROVER / sentence clustering / confidence measure / mis-recognized utterance detection |
Paper # | NLC2002-72 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2002/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Mis-recognized Utterance Detection Using Multiple Language Models Generated by Clustered Sentences |
Sub Title (in English) | |
Keyword(1) | ROVER |
Keyword(2) | sentence clustering |
Keyword(3) | confidence measure |
Keyword(4) | mis-recognized utterance detection |
1st Author's Name | Katsuhisa FUJINAGA |
1st Author's Affiliation | ATR Spoken Language Translation Research Laboratories:School of Information Science, Japan Advanced Institute of Science and Technology() |
2nd Author's Name | Hiroaki KOKUBO |
2nd Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
3rd Author's Name | Hirofumi YAMAMOTO |
3rd Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
4th Author's Name | Genichiro KIKUI |
4th Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
5th Author's Name | Hiroshi SHIMODAIRA |
5th Author's Affiliation | School of Information Science, Japan Advanced Institute of Science and Technology |
Date | 2002/12/13 |
Paper # | NLC2002-72 |
Volume (vol) | vol.102 |
Number (no) | 528 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |