Presentation | 2004/12/14 Speech Recognition Adopting Compensated Acoustic Likelihood based on Noise Models. Shoei SATO, Kazuo ONOE, Akio KOBAYASHI, Toru IMAI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | To improve recognition accuracy for speech uttered in a noisy environment, this paper proposes a new compensation method for acoustic scores in the - Viterbi search. In this method, to cope with wider varieties of background noise whose characteristics change rapidly, a confidence factor is obtained as a posterior probability of speech models or a likelihood ratio between speech models and noise models. This confidence factor represents the reliability of the acoustic score for the input speech. In decoding, weight of the acoustic score at a noisy frame is reduced according to the value of the confidence factor. An experiment with broadcast news transcription showed that this method reduced word errors for input speech with lower SNR values (0-5dB). The greatest reduction of word errors, by 20%, was obtained at an SNR of 0dB. This paper also proposes a modification of the compensation, which improved the recognition performance at a higher SNR of 10 dB. The proposed method is also applied to recognition of a noisy sports program. The results showed the method improved accuracy of keywords that is important for automatic meta-data extraction. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech recognition / noisy environment / acoustic score / compensation |
Paper # | NLC2004-58,SP2004-98 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2004/12/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Speech Recognition Adopting Compensated Acoustic Likelihood based on Noise Models. |
Sub Title (in English) | |
Keyword(1) | speech recognition |
Keyword(2) | noisy environment |
Keyword(3) | acoustic score |
Keyword(4) | compensation |
1st Author's Name | Shoei SATO |
1st Author's Affiliation | NHK Science and Technical Research Laboratories() |
2nd Author's Name | Kazuo ONOE |
2nd Author's Affiliation | NHK Science and Technical Research Laboratories |
3rd Author's Name | Akio KOBAYASHI |
3rd Author's Affiliation | NHK Science and Technical Research Laboratories |
4th Author's Name | Toru IMAI |
4th Author's Affiliation | NHK Science and Technical Research Laboratories |
Date | 2004/12/14 |
Paper # | NLC2004-58,SP2004-98 |
Volume (vol) | vol.104 |
Number (no) | 539 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |