Presentation | 2002/12/12 Normalizing the Acoustic Qualities of Monophones in an Utterance Muhammad GHULAM, Takashi FUKUDA, Tsuneo NITTA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we expand our previously proposed HMM-SM-based speech recognition system to a connected digit recognition task by exploring the effect of normalizing the acoustic qualities of the monophones in an utterance and compare it with a number of HMM-based systems with utterance-level normalization, word-level normalization, monophone-level normalization and without normalization. In the proposed HMM-SM-based system, an HMM-based classifier classifies the N-best hypotheses (word candidates), and then an SM (Subspace Method)-based verifier tests the hypotheses after applying the monophone score normalization. Experimental results performed on a connected digit recognition task showed that the word correct rate and the word accuracy rate were significantly improved by the proposed method from 96.3% to 98.7% and from 95.7% to 98.2%, respectively, compared with the convenient HMM-based classifier with utterance-level normalization. The proposed method also showed high performance over the other HMM-based systems that we have compared. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Speech Recognition / HMM / Normalization of Acoustic Quality / Subspace Method |
Paper # | NLC2002-45 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2002/12/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Normalizing the Acoustic Qualities of Monophones in an Utterance |
Sub Title (in English) | |
Keyword(1) | Speech Recognition |
Keyword(2) | HMM |
Keyword(3) | Normalization of Acoustic Quality |
Keyword(4) | Subspace Method |
1st Author's Name | Muhammad GHULAM |
1st Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology() |
2nd Author's Name | Takashi FUKUDA |
2nd Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology |
3rd Author's Name | Tsuneo NITTA |
3rd Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology |
Date | 2002/12/12 |
Paper # | NLC2002-45 |
Volume (vol) | vol.102 |
Number (no) | 527 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |