Presentation | 2007/12/13 Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor Marc DELCROIX, Tomohiro NAKATANI, Shinji WATANABE, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | It is well known that the performance of automatic speech recognition degrades severely in presence of noise or reverberation. Speech enhancement techniques may reduce such acoustic perturbations, but often do not interconnect well with speech recognizer. To cope with this problem, model adaptation is usually used to reduce the mismatch between the speech enhanced features and the acoustic model used by the recognizer. However, conventional model adaptation techniques assume static mismatch and may therefore not cope well with dynamic mismatch arising from noise or reverberation. There seems to be a lack of optimal ways to combine model adaptation and speech enhancement. In this paper we propose a novel adaptation scheme that may cope with dynamic mismatch. We introduce a parametric model for variance adaptation that includes static components, and dynamic components derived from a speech enhancement pre-process. The model parameters are optimized using adaptive training. An evaluation of the method with a speech dereverberation for pre-processing revealed that a 80% relative error rate reduction was possible compared with the recognition of dereverberated speech, and the final error rate was 5.4% which is close to that of clean speech (1.2%). |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Robust ASR / Variance compensation / Model adaptation |
Paper # | NLC2007-42,SP2007-105 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2007/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Dynamic feature variance adaptation for robust speech recognition with a speech enhancement pre-processor |
Sub Title (in English) | |
Keyword(1) | Robust ASR |
Keyword(2) | Variance compensation |
Keyword(3) | Model adaptation |
1st Author's Name | Marc DELCROIX |
1st Author's Affiliation | NTT Communication Science Laboratories, NTT Corporation() |
2nd Author's Name | Tomohiro NAKATANI |
2nd Author's Affiliation | NTT Communication Science Laboratories, NTT Corporation |
3rd Author's Name | Shinji WATANABE |
3rd Author's Affiliation | NTT Communication Science Laboratories, NTT Corporation |
Date | 2007/12/13 |
Paper # | NLC2007-42,SP2007-105 |
Volume (vol) | vol.107 |
Number (no) | 405 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |