形態素情報と単語内位置情報を用いた話し言葉音声認識のための音響モデル

五十川 賢造; 篠田 浩一; 嵯峨山 茂樹

Presentation	2002/12/12 Acoustic modeling using word contexts for spontaneous speech recognition Kenzo ISOGAWA, Koichi SHINODA, Shigeki SAGAYAMA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we study state clustering using word contexts for speech recognition. In spontaneous speech, poorly articulated words often cause recognition error. To improve the recognition performance, we add two questions used in the phonetical decision tree based state clustering. One is a question about parts of speech, and the other is a question about the position of phones within a word. To apply the question about parts of speech, we classify parts of speech into two classes based on the word's duration estimated by using the corpus of spontaneous speech. After making HMMs for each class, we carry out state clustering using a context desicion tree with the questions about the classes. To apply questions about the position of phones within a word, we make HMMs for phones at the beginning of the word, those for phones at the ending of the word, and those for phones at the other positions, separately. Then we carry out state clustering using a context desicion tree with questions about phone positions. We carried out speech recognition experiments using CSJ(Corpus of Spontaneous Japanese). In the best case, the word accuracy improved by 2.4 points with the use of the former method, and it improved by 6.1 points with the use of the latter method.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	spontaneous speech / acoustic model / part of speech / decision tree
Paper #	SP2002-139
Date of Issue

Conference Information
Committee	SP
Conference Date	2002/12/12(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Acoustic modeling using word contexts for spontaneous speech recognition
Sub Title (in English)
Keyword(1)	spontaneous speech
Keyword(2)	acoustic model
Keyword(3)	part of speech
Keyword(4)	decision tree
1st Author's Name	Kenzo ISOGAWA
1st Author's Affiliation	Graduate School of Information Science and Technology, The University of Tokyo()
2nd Author's Name	Koichi SHINODA
2nd Author's Affiliation	Graduate School of Information Science and Technology, The University of Tokyo
3rd Author's Name	Shigeki SAGAYAMA
3rd Author's Affiliation	Graduate School of Information Science and Technology, The University of Tokyo
Date	2002/12/12
Paper #	SP2002-139
Volume (vol)	vol.102
Number (no)	529
Page	pp.pp.-
#Pages	6
Date of Issue