DETECTION OF SPEECH SECTIONS FROM ACOUSTIC SIGNALS (Content Processing and Information Security)(International Workshop On Advanced Image Technology (IWAIT2004))

講演名	2004/1/5 DETECTION OF SPEECH SECTIONS FROM ACOUSTIC SIGNALS (Content Processing and Information Security)(International Workshop On Advanced Image Technology (IWAIT2004)) ,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)
抄録(英)	We study a method to detect speech sections from acoustic signals using lip image sequences. This method can reduce mistakes that the non-target person's speeches are recognized as those of the target person, since the lip images of the target person can be used. The proposed method employs the hidden Markov models to learn the characteristics of the speech and the non-speech sections in the lip image sequences. Especially, to reduce the ambiguities caused by the variations of the appearance of the lips in the images and of the brightness of the images, we examine the construction of the features, which are extracted from the images and are used to detect speech sections.
キーワード(和)
キーワード(英)
資料番号	IE2003-149
発行日

講演論文情報詳細
申込み研究会	Image Engineering (IE)
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	DETECTION OF SPEECH SECTIONS FROM ACOUSTIC SIGNALS (Content Processing and Information Security)(International Workshop On Advanced Image Technology (IWAIT2004))
サブタイトル（和）
キーワード(1)（和/英）
第 1 著者氏名（和/英）	/ Hidenori Terasawa
第 1 著者所属（和/英）	Graduate School of Engineering, Tokyo Metropolitan University
発表年月日	2004/1/5
資料番号	IE2003-149
巻番号（vol）	vol.103
号番号（no）	539
ページ範囲	pp.-
ページ数	5
発行日