Presentation | 2001/12/13 Syllable recognition using syllable-segmental statistics and syllable-based HMM Nobutoshi TAKAHASHI, Seiichi NAKAGAWA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In our previous research, we demonstrated the validity of segmental unit input hidden Markov model (HMM), which regards successive four frame MEL-cepstrum coefficients as a feature vector. The vector is compressed into 20 dimensions using the KL transform. However, the model considers only the correlation between frames in a short section, but not the correlation between the frames over a long section. In this paper, in order to represent the correlation over a long distance, we use the syllable-segmental statistics that are calculated by the concatenation of feature vectors, corresponding to each state in a syllable based HMM. As this concatenated feature vector consists of a high dimension, the dimension is reduced using the K-L transform. The statistics are modeled by a GMM. The use of syllable-segment statistics allows the model to express the correlation between the frames over a long distance (e.g., the correlation between a vector in the first state and a vector in the fourth state in a syllable-based HMM). For modeling and estimating, we conducted a forced Viterbi alignment against continuous speech using a conventional HMM, and then we segmented continuous speech into syllable segments. By combining this approach with a segmental-unit input HMM, the syllable recognition rate was improved to 87.7% from 83.7% for syllables taken from continuous speech, without using a language model. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | syllable recognition / segment model / HMM |
Paper # | NLC2001-51,SP2001-86 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2001/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Syllable recognition using syllable-segmental statistics and syllable-based HMM |
Sub Title (in English) | |
Keyword(1) | syllable recognition |
Keyword(2) | segment model |
Keyword(3) | HMM |
1st Author's Name | Nobutoshi TAKAHASHI |
1st Author's Affiliation | Information and Computer Sciences, Toyohashi University of Technology() |
2nd Author's Name | Seiichi NAKAGAWA |
2nd Author's Affiliation | Information and Computer Sciences, Toyohashi University of Technology |
Date | 2001/12/13 |
Paper # | NLC2001-51,SP2001-86 |
Volume (vol) | vol.101 |
Number (no) | 520 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |