Presentation | 2004/12/14 Multi-Channel Estimation of the Power Spectral Density of Noise for Mixtures of Non-Stationary Signals Wolfgang HERBORDT, Satoshi NAKAMURA, Walter KELLERMANN, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | The proposed paper deals with the estimation of the power spectral density (PSD) of noise for mixtures of non-stationary wideband signals exploiting sparseness in the time-frequency domain. The proposed method is applied to realize a beam former-plus-post filter structure for noise-robust speech recognition. Experiments with a small-scale 4-sensor microphones array show that interfering speech and music can be suppressed by 13 dB for an average signal-to-noise ratio of 0 dB without impairing the quality of the desired speech signal and without introducing 'musical noise'. This noise suppression is reflected by the improvement of the word accuracy of a connected-digit speech recognizer. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Noise estimation / microphone arrays / beam forming / speech recognition |
Paper # | NLC2004-76,SP2004-116 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2004/12/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Multi-Channel Estimation of the Power Spectral Density of Noise for Mixtures of Non-Stationary Signals |
Sub Title (in English) | |
Keyword(1) | Noise estimation |
Keyword(2) | microphone arrays |
Keyword(3) | beam forming |
Keyword(4) | speech recognition |
1st Author's Name | Wolfgang HERBORDT |
1st Author's Affiliation | ATR Spoken Language Translation Research Laboratories() |
2nd Author's Name | Satoshi NAKAMURA |
2nd Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
3rd Author's Name | Walter KELLERMANN |
3rd Author's Affiliation | Telecommunications Laboratory, University Erlangen-Nuremberg |
Date | 2004/12/14 |
Paper # | NLC2004-76,SP2004-116 |
Volume (vol) | vol.104 |
Number (no) | 542 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |