Presentation | 2002/12/12 Relationship of Analysis Frame Interval and Image Resolution in Automatic Lip-Reading Recognition Performance Hidekazu KATO, Akinobu LEE, Hiroshi SARUWATARI, Kiyohiro SHIKANO, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Automatic lip-reading research using the video sequence of the speaker's mouth has been carried out with significant interests in increasing the robustness of automatic speech recognition in noisy environments. However, it has not accomplished enough recognition rate yet. In this paper, we investigate the relationship of analysis frame interval and image resolution to check how they take effects on the lip-reading performance. Based on the experimental results under various analysis frame interval using the video sequence recorded by high speed camera, we make clean that it is effective to use the faster frame rate for high recognition performance. Another experimental results under various image resolution shows that the recognition performance does not depend on the image resolution. These results suggest that the visual feature vector extracted by our image based approach can reduce the resolution to 20 × 15 pix cells. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Automatic Lip-Reading / Analysis Frame Interval / Image Resolution / Image based Method |
Paper # | SP2002-142 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2002/12/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Relationship of Analysis Frame Interval and Image Resolution in Automatic Lip-Reading Recognition Performance |
Sub Title (in English) | |
Keyword(1) | Automatic Lip-Reading |
Keyword(2) | Analysis Frame Interval |
Keyword(3) | Image Resolution |
Keyword(4) | Image based Method |
1st Author's Name | Hidekazu KATO |
1st Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology() |
2nd Author's Name | Akinobu LEE |
2nd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
3rd Author's Name | Hiroshi SARUWATARI |
3rd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
4th Author's Name | Kiyohiro SHIKANO |
4th Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
Date | 2002/12/12 |
Paper # | SP2002-142 |
Volume (vol) | vol.102 |
Number (no) | 529 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |