文クラスタリングによる複数言語モデルを用いた誤認識文の推定

Presentation	2002/12/13 Mis-recognized Utterance Detection Using Multiple Language Models Generated by Clustered Sentences Katsuhisa FUJINAGA, Hiroaki KOKUBO, Hirofumi YAMAMOTO, Genichiro KIKUI, Hiroshi SHIMODAIRA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we propose a new method that detects mis-recognized utterances, based on voting scheme like ROVER. ROVER has two serious problems, 1) it is difficult to construct multiple speech recognition systems (SRSs), 2) calculation cost increases according to the number of SRSs. In contrast to the conventional ROVER, the proposed method uses multiple language models (LMs), general LM and sub LMs generated by clustered sentence, instead of different SRSs. Speech recognition with sub LMs is proceeded by rescoring, instead of parallel decoding. Through experiments, the proposed method resulted in 18-point higher precision with 10% loss of recall from baseline, and 22-point higher precision with 20% loss of recall.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	ROVER / sentence clustering / confidence measure / mis-recognized utterance detection
Paper #	NLC2002-72
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Mis-recognized Utterance Detection Using Multiple Language Models Generated by Clustered Sentences
Sub Title (in English)
Keyword(1)	ROVER
Keyword(2)	sentence clustering
Keyword(3)	confidence measure
Keyword(4)	mis-recognized utterance detection
1st Author's Name	Katsuhisa FUJINAGA
1st Author's Affiliation	ATR Spoken Language Translation Research Laboratories:School of Information Science, Japan Advanced Institute of Science and Technology()
2nd Author's Name	Hiroaki KOKUBO
2nd Author's Affiliation	ATR Spoken Language Translation Research Laboratories
3rd Author's Name	Hirofumi YAMAMOTO
3rd Author's Affiliation	ATR Spoken Language Translation Research Laboratories
4th Author's Name	Genichiro KIKUI
4th Author's Affiliation	ATR Spoken Language Translation Research Laboratories
5th Author's Name	Hiroshi SHIMODAIRA
5th Author's Affiliation	School of Information Science, Japan Advanced Institute of Science and Technology
Date	2002/12/13
Paper #	NLC2002-72
Volume (vol)	vol.102
Number (no)	528
Page	pp.pp.-
#Pages	6
Date of Issue