Presentation | 2003/4/17 Fusing Audio and Video Information toward Detection of Speech Events under Real Environments Takashi YOSHIMURA, Futoshi ASANO, Youichi MOTOMURA, Hideki ASOH, Naoyuki ICHIMURA, Kiyoshi YAMAMOTO, Satoshi NAKAMURA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, a method of detecting and separating speech events in a multiple-sound-source condition using audio and video information is proposed. For detecting speech events, sound localization using a microphone array and human tracking by a stereo vision is combined by a Bayesian network. From the inference results of the Bayesian network, the information on the time and location of speech events can be known in a multiple-sound-source condition. Based on the detected speech event information, a maximum likelihood adaptive beamformer is constructed and the speech signal is separated from background noises and interferences. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Sound Localization / Human Tracking / Information Fusion / Bayesian Network |
Paper # | EA2003-3,SP2003-3 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2003/4/17(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Fusing Audio and Video Information toward Detection of Speech Events under Real Environments |
Sub Title (in English) | |
Keyword(1) | Sound Localization |
Keyword(2) | Human Tracking |
Keyword(3) | Information Fusion |
Keyword(4) | Bayesian Network |
1st Author's Name | Takashi YOSHIMURA |
1st Author's Affiliation | AIST() |
2nd Author's Name | Futoshi ASANO |
2nd Author's Affiliation | AIST |
3rd Author's Name | Youichi MOTOMURA |
3rd Author's Affiliation | AIST |
4th Author's Name | Hideki ASOH |
4th Author's Affiliation | AIST |
5th Author's Name | Naoyuki ICHIMURA |
5th Author's Affiliation | AIST |
6th Author's Name | Kiyoshi YAMAMOTO |
6th Author's Affiliation | University of Tsukuba |
7th Author's Name | Satoshi NAKAMURA |
7th Author's Affiliation | ATR Spoken Language Translation Research Laboratories |
Date | 2003/4/17 |
Paper # | EA2003-3,SP2003-3 |
Volume (vol) | vol.103 |
Number (no) | 26 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |