AAMを用いた唇領域特徴による音声発話認識(一般セッション,クロスモーダル)

Presentation	2010-01-22 Speech Recognition Based on Lip Area Feature Captured by AAM Yuto KOMAI, Chikoto MIYAMOTO, Tetsuya TAKIGUCHI, Yasuo TARIKI,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	As one of the techniques for robust speech recognition under the noise environment, multimodal speech. recognition using lip dynamic scene information together with audio information is attracting attention and the research is advanced in recent years. Since audio information together with visual information plays a great role in multimodal speech recognition, image features you use becomes a significant point. As for the visual features, various features have been proposed because of the difference of the extraction methods while the feature such as MFCC is used to a certain degree for audio features so far. This paper proposes, for spoken word recognition, to utilize c combined parameter extracted by Active Appearance Model applied to a face image including the lip area. Active Appearance Model contains information of the coordinate value and the brightness value as the image feature.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Lip area / Active Appearance Model / combined parameter / integration of audio and visual
Paper #	CQ2009-107,PRMU2009-206,SP2009-147,MVE2009-129
Date of Issue

Paper Information
Registration To	Communication Quality (CQ)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Speech Recognition Based on Lip Area Feature Captured by AAM
Sub Title (in English)
Keyword(1)	Lip area
Keyword(2)	Active Appearance Model
Keyword(3)	combined parameter
Keyword(4)	integration of audio and visual
1st Author's Name	Yuto KOMAI
1st Author's Affiliation	Department of Computer and System Engineering, Faculty of Engineering, Kobe University()
2nd Author's Name	Chikoto MIYAMOTO
2nd Author's Affiliation	Graduate School of Engineering, Kobe University
3rd Author's Name	Tetsuya TAKIGUCHI
3rd Author's Affiliation	Organization of Advanced Science and Technology, Kobe University
4th Author's Name	Yasuo TARIKI
4th Author's Affiliation	Organization of Advanced Science and Technology, Kobe University
Date	2010-01-22
Paper #	CQ2009-107,PRMU2009-206,SP2009-147,MVE2009-129
Volume (vol)	vol.109
Number (no)	373
Page	pp.pp.-
#Pages	6
Date of Issue