Presentation | 2010/1/14 Multimodal speech recognition using multimodal voice activity detection Satoshi TAMURA, Masato ISHIKAWA, Takashi HASHIBA, Shin'ichi TAKEUCHI, Satoru HAYAMIZU, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Audio-Visual Automatic Speech Recognition (AVASR) has been developed to enhance the robustness in noisy environments, using visual information in addition to acoustic features. Similarly, Audio-Visual Voice Activity Detection (AVVAD) has been investigated and used to increase the precision of VAD, since detecting presence of speech in noisy audio signals contributes ASR performance. In this paper, we propose a novel speech recognition method combining AVASR and AVVAD: combinations of model-based and model-free, and feature-fusion-based or decision-fusion-based methods. To evaluate the proposed schemes, recognition experiments were conducted using noisy audio-visual data. Then it is found that the proposed method using the model-free feature-fusion AVVAD method outperforms not only audio-only ASR but also conventional AVASR. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | multimodal / speech recognition / voice activity detection / feature fusion / decision fusion |
Paper # | CQ2009-105,PRMU2009-204,SP2000-145,MVE2009-127 |
Date of Issue |
Conference Information | |
Committee | CQ |
---|---|
Conference Date | 2010/1/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Communication Quality (CQ) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Multimodal speech recognition using multimodal voice activity detection |
Sub Title (in English) | |
Keyword(1) | multimodal |
Keyword(2) | speech recognition |
Keyword(3) | voice activity detection |
Keyword(4) | feature fusion |
Keyword(5) | decision fusion |
1st Author's Name | Satoshi TAMURA |
1st Author's Affiliation | Faculty of Engineering, Gifu University() |
2nd Author's Name | Masato ISHIKAWA |
2nd Author's Affiliation | Graduated School of Engineering, Gifu University |
3rd Author's Name | Takashi HASHIBA |
3rd Author's Affiliation | Graduated School of Engineering, Gifu University |
4th Author's Name | Shin'ichi TAKEUCHI |
4th Author's Affiliation | Virtual System Laboratory, Gifu University |
5th Author's Name | Satoru HAYAMIZU |
5th Author's Affiliation | Faculty of Engineering, Gifu University |
Date | 2010/1/14 |
Paper # | CQ2009-105,PRMU2009-204,SP2000-145,MVE2009-127 |
Volume (vol) | vol.109 |
Number (no) | 373 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |