動的分散適応に基づく音声強調と音声認識の統合手法の提案(音声認識・識別,第9回音声言語シンポジウム)

デルクロア マーク; 中谷 智広; 渡部 晋治

Presentation	2007/12/13 Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor Marc DELCROIX, Tomohiro NAKATANI, Shinji WATANABE,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	It is well known that the performance of automatic speech recognition degrades severely in presence of noise or reverberation. Speech enhancement techniques may reduce such acoustic perturbations, but often do not interconnect well with speech recognizer. To cope with this problem, model adaptation is usually used to reduce the mismatch between the speech enhanced features and the acoustic model used by the recognizer. However, conventional model adaptation techniques assume static mismatch and may therefore not cope well with dynamic mismatch arising from noise or reverberation. There seems to be a lack of optimal ways to combine model adaptation and speech enhancement. In this paper we propose a novel adaptation scheme that may cope with dynamic mismatch. We introduce a parametric model for variance adaptation that includes static components, and dynamic components derived from a speech enhancement pre-process. The model parameters are optimized using adaptive training. An evaluation of the method with a speech dereverberation for pre-processing revealed that a 80% relative error rate reduction was possible compared with the recognition of dereverberated speech, and the final error rate was 5.4% which is close to that of clean speech (1.2%).
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Robust ASR / Variance compensation / Model adaptation
Paper #	NLC2007-42,SP2007-105
Date of Issue

Conference Information
Committee	NLC
Conference Date	2007/12/13(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor
Sub Title (in English)
Keyword(1)	Robust ASR
Keyword(2)	Variance compensation
Keyword(3)	Model adaptation
1st Author's Name	Marc DELCROIX
1st Author's Affiliation	NTT Communication Science Laboratories, NTT Corporation()
2nd Author's Name	Tomohiro NAKATANI
2nd Author's Affiliation	NTT Communication Science Laboratories, NTT Corporation
3rd Author's Name	Shinji WATANABE
3rd Author's Affiliation	NTT Communication Science Laboratories, NTT Corporation
Date	2007/12/13
Paper #	NLC2007-42,SP2007-105
Volume (vol)	vol.107
Number (no)	405
Page	pp.pp.-
#Pages	6
Date of Issue