雑音モデルに基づく補正音響尤度を用いた音声認識(雑音下音声処理)(第6回音声言語シンポジウム)

佐藤 庄衛; 尾上 和穂; 小林 彰夫; 今井 亨

Presentation	2004/12/14 Speech Recognition Adopting Compensated Acoustic Likelihood based on Noise Models. Shoei SATO, Kazuo ONOE, Akio KOBAYASHI, Toru IMAI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	To improve recognition accuracy for speech uttered in a noisy environment, this paper proposes a new compensation method for acoustic scores in the - Viterbi search. In this method, to cope with wider varieties of background noise whose characteristics change rapidly, a confidence factor is obtained as a posterior probability of speech models or a likelihood ratio between speech models and noise models. This confidence factor represents the reliability of the acoustic score for the input speech. In decoding, weight of the acoustic score at a noisy frame is reduced according to the value of the confidence factor. An experiment with broadcast news transcription showed that this method reduced word errors for input speech with lower SNR values (0-5dB). The greatest reduction of word errors, by 20%, was obtained at an SNR of 0dB. This paper also proposes a modification of the compensation, which improved the recognition performance at a higher SNR of 10 dB. The proposed method is also applied to recognition of a noisy sports program. The results showed the method improved accuracy of keywords that is important for automatic meta-data extraction.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	speech recognition / noisy environment / acoustic score / compensation
Paper #	NLC2004-58,SP2004-98
Date of Issue

Conference Information
Committee	NLC
Conference Date	2004/12/14(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Speech Recognition Adopting Compensated Acoustic Likelihood based on Noise Models.
Sub Title (in English)
Keyword(1)	speech recognition
Keyword(2)	noisy environment
Keyword(3)	acoustic score
Keyword(4)	compensation
1st Author's Name	Shoei SATO
1st Author's Affiliation	NHK Science and Technical Research Laboratories()
2nd Author's Name	Kazuo ONOE
2nd Author's Affiliation	NHK Science and Technical Research Laboratories
3rd Author's Name	Akio KOBAYASHI
3rd Author's Affiliation	NHK Science and Technical Research Laboratories
4th Author's Name	Toru IMAI
4th Author's Affiliation	NHK Science and Technical Research Laboratories
Date	2004/12/14
Paper #	NLC2004-58,SP2004-98
Volume (vol)	vol.104
Number (no)	539
Page	pp.pp.-
#Pages	6
Date of Issue