Presentation 2000/12/14
Evaluation of Sound Source Discrimination Based on HMMs Using a Microphone Array
Takanobu NISHIURA, Satoshi NAKAMURA, Kiyohiro SHIKANO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate as an effective method for capturing distant talking speech. However, it is necessary to localize the target talker before capturing distant talking speech using a microphone array. In the conventional method of talker localization, it is difficult to estimate the target talker position accurately among localized sound sources, while the sound sources can be easily localized in a multiple sound source environment. To cope with this problem, we propose a talker localization algorithm by discriminating the sound sources using statistical speech and noise models based on HMMs (Hidden Marcov Models). First, the directions of signal arrival are estimated using a microphone array. Then, the desired sound signals are enhanced by steering the directivities to the estimated directions of signal arrival. Tha talker can be localized after discriminating between "speech" or "noise" from the desired sound signals using statistical speech and noise HMMs. In this paper, we evaluate the discrimination performance for the source position-known condition and position-unknown condition. The system recognizes the input from a sound source which is discriminated as being "speech" using statistical speech and noise HMMs. As a result, we confirm that the talker position is localized accurately because speech and noise can be discriminated efficiently in reverberant environments.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Microphone array / Sound source discrimination / HMM / Talker localization / Speech recognition / RWCP-DB
Paper # NLC2000-32,SP2000-80
Date of Issue

Conference Information
Committee NLC
Conference Date 2000/12/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Evaluation of Sound Source Discrimination Based on HMMs Using a Microphone Array
Sub Title (in English)
Keyword(1) Microphone array
Keyword(2) Sound source discrimination
Keyword(3) HMM
Keyword(4) Talker localization
Keyword(5) Speech recognition
Keyword(6) RWCP-DB
1st Author's Name Takanobu NISHIURA
1st Author's Affiliation ATR Spoken Language Translation Research Laboratories : Graduate School of Information Science, Nara Institute of Science and Technology()
2nd Author's Name Satoshi NAKAMURA
2nd Author's Affiliation ATR Spoken Language Translation Research Laboratories
3rd Author's Name Kiyohiro SHIKANO
3rd Author's Affiliation Graduate School of Information Science, Nara Institute of Science and Technology
Date 2000/12/14
Paper # NLC2000-32,SP2000-80
Volume (vol) vol.100
Number (no) 520
Page pp.pp.-
#Pages 6
Date of Issue