マイクロホンアレーを用いたHMMに基づく音源識別の評価

西浦 敬信; 中村 哲; 鹿野 清宏

Presentation	2000/12/14 Evaluation of Sound Source Discrimination Based on HMMs Using a Microphone Array Takanobu NISHIURA, Satoshi NAKAMURA, Kiyohiro SHIKANO,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate as an effective method for capturing distant talking speech. However, it is necessary to localize the target talker before capturing distant talking speech using a microphone array. In the conventional method of talker localization, it is difficult to estimate the target talker position accurately among localized sound sources, while the sound sources can be easily localized in a multiple sound source environment. To cope with this problem, we propose a talker localization algorithm by discriminating the sound sources using statistical speech and noise models based on HMMs (Hidden Marcov Models). First, the directions of signal arrival are estimated using a microphone array. Then, the desired sound signals are enhanced by steering the directivities to the estimated directions of signal arrival. Tha talker can be localized after discriminating between "speech" or "noise" from the desired sound signals using statistical speech and noise HMMs. In this paper, we evaluate the discrimination performance for the source position-known condition and position-unknown condition. The system recognizes the input from a sound source which is discriminated as being "speech" using statistical speech and noise HMMs. As a result, we confirm that the talker position is localized accurately because speech and noise can be discriminated efficiently in reverberant environments.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Microphone array / Sound source discrimination / HMM / Talker localization / Speech recognition / RWCP-DB
Paper #	NLC2000-32,SP2000-80
Date of Issue

Conference Information
Committee	NLC
Conference Date	2000/12/14(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Evaluation of Sound Source Discrimination Based on HMMs Using a Microphone Array
Sub Title (in English)
Keyword(1)	Microphone array
Keyword(2)	Sound source discrimination
Keyword(3)	HMM
Keyword(4)	Talker localization
Keyword(5)	Speech recognition
Keyword(6)	RWCP-DB
1st Author's Name	Takanobu NISHIURA
1st Author's Affiliation	ATR Spoken Language Translation Research Laboratories : Graduate School of Information Science, Nara Institute of Science and Technology()
2nd Author's Name	Satoshi NAKAMURA
2nd Author's Affiliation	ATR Spoken Language Translation Research Laboratories
3rd Author's Name	Kiyohiro SHIKANO
3rd Author's Affiliation	Graduate School of Information Science, Nara Institute of Science and Technology
Date	2000/12/14
Paper #	NLC2000-32,SP2000-80
Volume (vol)	vol.100
Number (no)	520
Page	pp.pp.-
#Pages	6
Date of Issue