Presentation | 2000/12/14 Evaluation of Sound Source Discrimination Based on HMMs Using a Microphone Array Takanobu NISHIURA, Satoshi NAKAMURA, Kiyohiro SHIKANO, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate as an effective method for capturing distant talking speech. However, it is necessary to localize the target talker before capturing distant talking speech using a microphone array. In the conventional method of talker localization, it is difficult to estimate the target talker position accurately among localized sound sources, while the sound sources can be easily localized in a multiple sound source environment. To cope with this problem, we propose a talker localization algorithm by discriminating the sound sources using statistical speech and noise models based on HMMs (Hidden Marcov Models). First, the directions of signal arrival are estimated using a microphone array. Then, the desired sound signals are enhanced by steering the directivities to the estimated directions of signal arrival. Tha talker can be localized after discriminating between "speech" or "noise" from the desired sound signals using statistical speech and noise HMMs. In this paper, we evaluate the discrimination performance for the source position-known condition and position-unknown condition. The system recognizes the input from a sound source which is discriminated as being "speech" using statistical speech and noise HMMs. As a result, we confirm that the talker position is localized accurately because speech and noise can be discriminated efficiently in reverberant environments. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Microphone array / Sound source discrimination / HMM / Talker localization / Speech recognition / RWCP-DB |
Paper # | NLC2000-32,SP2000-80 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2000/12/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Evaluation of Sound Source Discrimination Based on HMMs Using a Microphone Array |
Sub Title (in English) | |
Keyword(1) | Microphone array |
Keyword(2) | Sound source discrimination |
Keyword(3) | HMM |
Keyword(4) | Talker localization |
Keyword(5) | Speech recognition |
Keyword(6) | RWCP-DB |
1st Author's Name | Takanobu NISHIURA |
1st Author's Affiliation | ATR Spoken Language Translation Research Laboratories : Graduate School of Information Science, Nara Institute of Science and Technology() |
2nd Author's Name | Satoshi NAKAMURA |
2nd Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
3rd Author's Name | Kiyohiro SHIKANO |
3rd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
Date | 2000/12/14 |
Paper # | NLC2000-32,SP2000-80 |
Volume (vol) | vol.100 |
Number (no) | 520 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |