単語適合率最大基準に基づく複数システムの統合

Presentation	2002/6/21 Combination of Multiple Recognizer Outputs Based on Maximum Word Acceptance Rate Path Selection Hirofumi YAMAMOTO, Konstantin MARKOV, Kozo OKUDA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, a new method for combining multiple recognition results is proposed. There are three important issues when combining multiple results. (1) What score is associated with each word. (2) How to combine different word sequences, i.e., how to construct a word graph. (3) How to select the best path through the word graph. In our approach, a word a-posteriori probability is used as the word score. The word graph is constructed by the N-dimensional DP matching of multiple word sequences based on the minimum edit distance. As for the best path, we select the word path that gives the maximum expected word acceptance rate. We evaluated this method in two experiments. In the first experiment, we combined the outputs of three systems having different acoustic features. The proposed method resulted in a 2.2-point lower word error rate. In the second experiment, we combined the outputs of the three systems, which use acoustic models with three different frame shifts, and we achieved a 0.6-point lower word error rate.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	ROVER / DP-matching / word acceptance rate
Paper #	SP2002-49
Date of Issue

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Combination of Multiple Recognizer Outputs Based on Maximum Word Acceptance Rate Path Selection
Sub Title (in English)
Keyword(1)	ROVER
Keyword(2)	DP-matching
Keyword(3)	word acceptance rate
1st Author's Name	Hirofumi YAMAMOTO
1st Author's Affiliation	ATR Spoken Language Translation Research Laboratories()
2nd Author's Name	Konstantin MARKOV
2nd Author's Affiliation	ATR Spoken Language Translation Research Laboratories
3rd Author's Name	Kozo OKUDA
3rd Author's Affiliation	ATR Spoken Language Translation Research Laboratories:(Present address)Sanyo Electric Co., Ltd. Technology R & D Headquarters Digital Systems Development Center Human Interface Department
Date	2002/6/21
Paper #	SP2002-49
Volume (vol)	vol.102
Number (no)	160
Page	pp.pp.-
#Pages	5
Date of Issue