Presentation 2004/12/14
Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone
Weifeng LI, Katunobu ITOU, Kazuya TAKEDA, Fumitada ITAKURA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we present a two-stage noise spectra estimation approach. After the first-stage noise estimation using the improved minima controlled recursive averaging (IMCRA) method, the second-stage noise estimation is performed by employing a maximum a posteriori (MAP) noise amplitude estimator. We also develop a regression-based speech enhance system by approximating the clean speech with the estimated noise and original noisy speech. Evaluation experiments show that the proposed two-stage noise estimation method results in lower estimation error for all test noise types. Compared to original noisy speech, the proposed regression-based approach obtains an average relative word error rate (WER) reduction of 65% in our isolated word recognition experiments conducted in 12 real car environments.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) maximum a posteriori (MAP) estimation / spectral subtraction / speech enhancement / multi-layer perceptron / speech recognition
Paper # NLC2004-77,SP2004-117
Date of Issue

Conference Information
Committee NLC
Conference Date 2004/12/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone
Sub Title (in English)
Keyword(1) maximum a posteriori (MAP) estimation
Keyword(2) spectral subtraction
Keyword(3) speech enhancement
Keyword(4) multi-layer perceptron
Keyword(5) speech recognition
1st Author's Name Weifeng LI
1st Author's Affiliation Graduate School of Engineering, Nagoya University()
2nd Author's Name Katunobu ITOU
2nd Author's Affiliation Graduate School of Information Science, Nagoya University
3rd Author's Name Kazuya TAKEDA
3rd Author's Affiliation Graduate School of Information Science, Nagoya University
4th Author's Name Fumitada ITAKURA
4th Author's Affiliation Faculty of Science and Technology, Meijo University, Meijo University
Date 2004/12/14
Paper # NLC2004-77,SP2004-117
Volume (vol) vol.104
Number (no) 539
Page pp.pp.-
#Pages 6
Date of Issue