Presentation | 2010-01-22 Speech Recognition Based on Lip Area Feature Captured by AAM Yuto KOMAI, Chikoto MIYAMOTO, Tetsuya TAKIGUCHI, Yasuo TARIKI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | As one of the techniques for robust speech recognition under the noise environment, multimodal speech. recognition using lip dynamic scene information together with audio information is attracting attention and the research is advanced in recent years. Since audio information together with visual information plays a great role in multimodal speech recognition, image features you use becomes a significant point. As for the visual features, various features have been proposed because of the difference of the extraction methods while the feature such as MFCC is used to a certain degree for audio features so far. This paper proposes, for spoken word recognition, to utilize c combined parameter extracted by Active Appearance Model applied to a face image including the lip area. Active Appearance Model contains information of the coordinate value and the brightness value as the image feature. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Lip area / Active Appearance Model / combined parameter / integration of audio and visual |
Paper # | CQ2009-107,PRMU2009-206,SP2009-147,MVE2009-129 |
Date of Issue |
Conference Information | |
Committee | CQ |
---|---|
Conference Date | 2010/1/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Communication Quality (CQ) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Speech Recognition Based on Lip Area Feature Captured by AAM |
Sub Title (in English) | |
Keyword(1) | Lip area |
Keyword(2) | Active Appearance Model |
Keyword(3) | combined parameter |
Keyword(4) | integration of audio and visual |
1st Author's Name | Yuto KOMAI |
1st Author's Affiliation | Department of Computer and System Engineering, Faculty of Engineering, Kobe University() |
2nd Author's Name | Chikoto MIYAMOTO |
2nd Author's Affiliation | Graduate School of Engineering, Kobe University |
3rd Author's Name | Tetsuya TAKIGUCHI |
3rd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
4th Author's Name | Yasuo TARIKI |
4th Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
Date | 2010-01-22 |
Paper # | CQ2009-107,PRMU2009-206,SP2009-147,MVE2009-129 |
Volume (vol) | vol.109 |
Number (no) | 373 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |