周波数領域両耳聴モデルによる音源方向推定と音源分離(音声・音響情報システム及び一般)

中島 栄俊; 苣木 禎史; 宇佐川 毅

Presentation	2004/9/10 DOA estimation and speech signal segregation based on frequency domain binaural model Hidetoshi NAKASHIMA, Yoshifumi CHISAKI, Tsuyoshi USAGAWA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	As known as a "Cocktail Party Effect", we can communicate others under noisy environments. This effect is based on binaural functions and the human segregates the specific sound by using directional information as a cue of the sound. The computational model for cocktail party effect has been studied, we also proposed it as called "Frequency Domain Binaural Model (FDBM)" which has some characteristics such as less computational load, high segregation quality, and the keep the binaural information of the segregated sound. In this paper, the basic algorithm of FDBM and its performance for segregation obtained by the computer simulations are addressed. According to the evaluation as a speech enhancer, the envelope of the segregated signal is recovered and quite similar to the one of the target signal. On the other hand, more than 90% recognition rates are obtained in speech recognition task, when the azimuth of reception of the target signal and noise differs by 10°.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Binaural Model / Frequency Domain / Interaural Phase Difference / Interaural Level Difference / Head-Related Transfer Function / Sound Segregation / Direction of Arrival Estimation
Paper #	EA2004-71,SIP2004-75,SIS2004-42
Date of Issue

Conference Information
Committee	SIS
Conference Date	2004/9/10(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Smart Info-Media Systems (SIS)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	DOA estimation and speech signal segregation based on frequency domain binaural model
Sub Title (in English)
Keyword(1)	Binaural Model
Keyword(2)	Frequency Domain
Keyword(3)	Interaural Phase Difference
Keyword(4)	Interaural Level Difference
Keyword(5)	Head-Related Transfer Function
Keyword(6)	Sound Segregation
Keyword(7)	Direction of Arrival Estimation
1st Author's Name	Hidetoshi NAKASHIMA
1st Author's Affiliation	Kumamoto National College of Technology()
2nd Author's Name	Yoshifumi CHISAKI
2nd Author's Affiliation	Faculty of Engineering, Kumamoto University
3rd Author's Name	Tsuyoshi USAGAWA
3rd Author's Affiliation	Faculty of Engineering, Kumamoto University
Date	2004/9/10
Paper #	EA2004-71,SIP2004-75,SIS2004-42
Volume (vol)	vol.104
Number (no)	308
Page	pp.pp.-
#Pages	6
Date of Issue