Presentation 2007/12/13
Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor
Marc DELCROIX, Tomohiro NAKATANI, Shinji WATANABE,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) It is well known that the performance of automatic speech recognition degrades severely in presence of noise or reverberation. Speech enhancement techniques may reduce such acoustic perturbations, but often do not interconnect well with speech recognizer. To cope with this problem, model adaptation is usually used to reduce the mismatch between the speech enhanced features and the acoustic model used by the recognizer. However, conventional model adaptation techniques assume static mismatch and may therefore not cope well with dynamic mismatch arising from noise or reverberation. There seems to be a lack of optimal ways to combine model adaptation and speech enhancement. In this paper we propose a novel adaptation scheme that may cope with dynamic mismatch. We introduce a parametric model for variance adaptation that includes static components, and dynamic components derived from a speech enhancement pre-process. The model parameters are optimized using adaptive training. An evaluation of the method with a speech dereverberation for pre-processing revealed that a 80% relative error rate reduction was possible compared with the recognition of dereverberated speech, and the final error rate was 5.4% which is close to that of clean speech (1.2%).
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Robust ASR / Variance compensation / Model adaptation
Paper # NLC2007-42,SP2007-105
Date of Issue

Conference Information
Committee NLC
Conference Date 2007/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor
Sub Title (in English)
Keyword(1) Robust ASR
Keyword(2) Variance compensation
Keyword(3) Model adaptation
1st Author's Name Marc DELCROIX
1st Author's Affiliation NTT Communication Science Laboratories, NTT Corporation()
2nd Author's Name Tomohiro NAKATANI
2nd Author's Affiliation NTT Communication Science Laboratories, NTT Corporation
3rd Author's Name Shinji WATANABE
3rd Author's Affiliation NTT Communication Science Laboratories, NTT Corporation
Date 2007/12/13
Paper # NLC2007-42,SP2007-105
Volume (vol) vol.107
Number (no) 405
Page pp.pp.-
#Pages 6
Date of Issue