Presentation | 2002/12/12 Acoustic modeling using word contexts for spontaneous speech recognition Kenzo ISOGAWA, Koichi SHINODA, Shigeki SAGAYAMA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we study state clustering using word contexts for speech recognition. In spontaneous speech, poorly articulated words often cause recognition error. To improve the recognition performance, we add two questions used in the phonetical decision tree based state clustering. One is a question about parts of speech, and the other is a question about the position of phones within a word. To apply the question about parts of speech, we classify parts of speech into two classes based on the word's duration estimated by using the corpus of spontaneous speech. After making HMMs for each class, we carry out state clustering using a context desicion tree with the questions about the classes. To apply questions about the position of phones within a word, we make HMMs for phones at the beginning of the word, those for phones at the ending of the word, and those for phones at the other positions, separately. Then we carry out state clustering using a context desicion tree with questions about phone positions. We carried out speech recognition experiments using CSJ(Corpus of Spontaneous Japanese). In the best case, the word accuracy improved by 2.4 points with the use of the former method, and it improved by 6.1 points with the use of the latter method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | spontaneous speech / acoustic model / part of speech / decision tree |
Paper # | SP2002-139 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2002/12/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Acoustic modeling using word contexts for spontaneous speech recognition |
Sub Title (in English) | |
Keyword(1) | spontaneous speech |
Keyword(2) | acoustic model |
Keyword(3) | part of speech |
Keyword(4) | decision tree |
1st Author's Name | Kenzo ISOGAWA |
1st Author's Affiliation | Graduate School of Information Science and Technology, The University of Tokyo() |
2nd Author's Name | Koichi SHINODA |
2nd Author's Affiliation | Graduate School of Information Science and Technology, The University of Tokyo |
3rd Author's Name | Shigeki SAGAYAMA |
3rd Author's Affiliation | Graduate School of Information Science and Technology, The University of Tokyo |
Date | 2002/12/12 |
Paper # | SP2002-139 |
Volume (vol) | vol.102 |
Number (no) | 529 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |