Presentation 2002/12/13
Mis-recognized Utterance Detection Using Multiple Language Models Generated by Clustered Sentences
Katsuhisa FUJINAGA, Hiroaki KOKUBO, Hirofumi YAMAMOTO, Genichiro KIKUI, Hiroshi SHIMODAIRA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we propose a new method that detects mis-recognized utterances, based on voting scheme like ROVER. ROVER has two serious problems, 1) it is difficult to construct multiple speech recognition systems (SRSs), 2) calculation cost increases according to the number of SRSs. In contrast to the conventional ROVER, the proposed method uses multiple language models (LMs), general LM and sub LMs generated by clustered sentence, instead of different SRSs. Speech recognition with sub LMs is proceeded by rescoring, instead of parallel decoding. Through experiments, the proposed method resulted in 18-point higher precision with 10% loss of recall from baseline, and 22-point higher precision with 20% loss of recall.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) ROVER / sentence clustering / confidence measure / mis-recognized utterance detection
Paper # NLC2002-72
Date of Issue

Conference Information
Committee NLC
Conference Date 2002/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Mis-recognized Utterance Detection Using Multiple Language Models Generated by Clustered Sentences
Sub Title (in English)
Keyword(1) ROVER
Keyword(2) sentence clustering
Keyword(3) confidence measure
Keyword(4) mis-recognized utterance detection
1st Author's Name Katsuhisa FUJINAGA
1st Author's Affiliation ATR Spoken Language Translation Research Laboratories:School of Information Science, Japan Advanced Institute of Science and Technology()
2nd Author's Name Hiroaki KOKUBO
2nd Author's Affiliation ATR Spoken Language Translation Research Laboratories
3rd Author's Name Hirofumi YAMAMOTO
3rd Author's Affiliation ATR Spoken Language Translation Research Laboratories
4th Author's Name Genichiro KIKUI
4th Author's Affiliation ATR Spoken Language Translation Research Laboratories
5th Author's Name Hiroshi SHIMODAIRA
5th Author's Affiliation School of Information Science, Japan Advanced Institute of Science and Technology
Date 2002/12/13
Paper # NLC2002-72
Volume (vol) vol.102
Number (no) 528
Page pp.pp.-
#Pages 6
Date of Issue