Presentation | 2006-03-22 Omni-directional Estimation of Sound Source Location and Its Application to Estimation of Speaker's Position by Combining with Skin-color Information Satoshi TAKAHASHI, Jun-ichi IMAI, Masahide KANEKO, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | A user doesn't always stand in front of a robot and may sometimes call a robot from its back. Therefore a robot should know the user's position first to start communication with him/her. This paper proposes the omni-directional estimation method of speaker's position using the combination of audio and color information. Estimation of the position of sound source is carried out to calculate the difference of arrival time from the sound source to multi-channel microphones. Number of microphones and their optimal arrangement are derived considering the accuracy and processing time for 3-D estimation of sound source position. Next the robust detection of skin-color region is carried out by combining a trained GMM (Gaussian Mixture Model) for input scene with a general GMM. Bayesian network is employed to combine the result of sound source estimation and detection of skin-color region, and to realize a highly accurate estimation of speaker's location. Experimental results are shown to demonstrate the usefulness of the proposed methods. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | estimation of sound source location / omni-direction / speaker's location / skin-color information / CSP method / Gaussian mixture model |
Paper # | MVE2005-70 |
Date of Issue |
Conference Information | |
Committee | MVE |
---|---|
Conference Date | 2006/3/15(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Media Experience and Virtual Environment (MVE) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Omni-directional Estimation of Sound Source Location and Its Application to Estimation of Speaker's Position by Combining with Skin-color Information |
Sub Title (in English) | |
Keyword(1) | estimation of sound source location |
Keyword(2) | omni-direction |
Keyword(3) | speaker's location |
Keyword(4) | skin-color information |
Keyword(5) | CSP method |
Keyword(6) | Gaussian mixture model |
1st Author's Name | Satoshi TAKAHASHI |
1st Author's Affiliation | Graduate School of Electro-Communications, The University of Eletro-Communications() |
2nd Author's Name | Jun-ichi IMAI |
2nd Author's Affiliation | Graduate School of Electro-Communications, The University of Eletro-Communications |
3rd Author's Name | Masahide KANEKO |
3rd Author's Affiliation | Graduate School of Electro-Communications, The University of Eletro-Communications |
Date | 2006-03-22 |
Paper # | MVE2005-70 |
Volume (vol) | vol.105 |
Number (no) | 683 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |