自動読唇における分析フレーム間隔および画像解像度に関する調査

Presentation	2002/12/12 Relationship of Analysis Frame Interval and Image Resolution in Automatic Lip-Reading Recognition Performance Hidekazu KATO, Akinobu LEE, Hiroshi SARUWATARI, Kiyohiro SHIKANO,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Automatic lip-reading research using the video sequence of the speaker's mouth has been carried out with significant interests in increasing the robustness of automatic speech recognition in noisy environments. However, it has not accomplished enough recognition rate yet. In this paper, we investigate the relationship of analysis frame interval and image resolution to check how they take effects on the lip-reading performance. Based on the experimental results under various analysis frame interval using the video sequence recorded by high speed camera, we make clean that it is effective to use the faster frame rate for high recognition performance. Another experimental results under various image resolution shows that the recognition performance does not depend on the image resolution. These results suggest that the visual feature vector extracted by our image based approach can reduce the resolution to 20 × 15 pix cells.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Automatic Lip-Reading / Analysis Frame Interval / Image Resolution / Image based Method
Paper #	SP2002-142
Date of Issue

Paper Information
Registration To	Speech (SP)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Relationship of Analysis Frame Interval and Image Resolution in Automatic Lip-Reading Recognition Performance
Sub Title (in English)
Keyword(1)	Automatic Lip-Reading
Keyword(2)	Analysis Frame Interval
Keyword(3)	Image Resolution
Keyword(4)	Image based Method
1st Author's Name	Hidekazu KATO
1st Author's Affiliation	Graduate School of Information Science, Nara Institute of Science and Technology()
2nd Author's Name	Akinobu LEE
2nd Author's Affiliation	Graduate School of Information Science, Nara Institute of Science and Technology
3rd Author's Name	Hiroshi SARUWATARI
3rd Author's Affiliation	Graduate School of Information Science, Nara Institute of Science and Technology
4th Author's Name	Kiyohiro SHIKANO
4th Author's Affiliation	Graduate School of Information Science, Nara Institute of Science and Technology
Date	2002/12/12
Paper #	SP2002-142
Volume (vol)	vol.102
Number (no)	529
Page	pp.pp.-
#Pages	6
Date of Issue