Presentation | 2004/12/14 Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone Weifeng LI, Katunobu ITOU, Kazuya TAKEDA, Fumitada ITAKURA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we present a two-stage noise spectra estimation approach. After the first-stage noise estimation using the improved minima controlled recursive averaging (IMCRA) method, the second-stage noise estimation is performed by employing a maximum a posteriori (MAP) noise amplitude estimator. We also develop a regression-based speech enhance system by approximating the clean speech with the estimated noise and original noisy speech. Evaluation experiments show that the proposed two-stage noise estimation method results in lower estimation error for all test noise types. Compared to original noisy speech, the proposed regression-based approach obtains an average relative word error rate (WER) reduction of 65% in our isolated word recognition experiments conducted in 12 real car environments. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | maximum a posteriori (MAP) estimation / spectral subtraction / speech enhancement / multi-layer perceptron / speech recognition |
Paper # | NLC2004-77,SP2004-117 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2004/12/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone |
Sub Title (in English) | |
Keyword(1) | maximum a posteriori (MAP) estimation |
Keyword(2) | spectral subtraction |
Keyword(3) | speech enhancement |
Keyword(4) | multi-layer perceptron |
Keyword(5) | speech recognition |
1st Author's Name | Weifeng LI |
1st Author's Affiliation | Graduate School of Engineering, Nagoya University() |
2nd Author's Name | Katunobu ITOU |
2nd Author's Affiliation | Graduate School of Information Science, Nagoya University |
3rd Author's Name | Kazuya TAKEDA |
3rd Author's Affiliation | Graduate School of Information Science, Nagoya University |
4th Author's Name | Fumitada ITAKURA |
4th Author's Affiliation | Faculty of Science and Technology, Meijo University, Meijo University |
Date | 2004/12/14 |
Paper # | NLC2004-77,SP2004-117 |
Volume (vol) | vol.104 |
Number (no) | 539 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |