Paper Abstract and Keywords |
Presentation |
2012-03-08 12:45
Word lip reading from scenes of speaker's utterance profile based on mouth-shape-code approach Shinsuke Okita, Yuki Sato, Yuki Sugata, Takuro Tasaka, Nozomu Hamada (Keio Univ.) CAS2011-112 SIP2011-132 CS2011-104 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In this paper, we apply mouth-shape-approach to Japanese speaker’s utterance profile for lip reading.The novel point is to propose automatic detection of consonant-key-frames. To detect the consonant-key-frames by time series of profile feature vector which is defined the difference value of distance of lips and projection length of lower lip. This approach provides an extension of mouth-shape-code time series. The mouth-shape recognition of key-frames is conducted by five profile shape features; the height of upper lip and lower lip, the projection length of upper and lower lip points, and the angle of lips. We apply DP-matching to the recognized word code string of key-frames and a candidate word code string, then search the nearest word as the result. Recognition experiments using two sets of target 27 words commonly used in dairy conversation, and adding 10 pairs of similar words to them are conducted. The proposed method attained 90.4%, and 86.7% for these word set respectively. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
lip reading / mouth-shape-code / key-frame / profile / image processing / / / |
Reference Info. |
IEICE Tech. Rep., vol. 111, no. 466, SIP2011-132, pp. 31-36, March 2012. |
Paper # |
SIP2011-132 |
Date of Issue |
2012-03-01 (CAS, SIP, CS) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
CAS2011-112 SIP2011-132 CS2011-104 |
Conference Information |
Committee |
CAS CS SIP |
Conference Date |
2012-03-08 - 2012-03-09 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
The University of Niigata |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Network Processor, Signal Processing for communication, and Wireless LAN/PAN, etc. |
Paper Information |
Registration To |
SIP |
Conference Code |
2012-03-CAS-CS-SIP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Word lip reading from scenes of speaker's utterance profile based on mouth-shape-code approach |
Sub Title (in English) |
|
Keyword(1) |
lip reading |
Keyword(2) |
mouth-shape-code |
Keyword(3) |
key-frame |
Keyword(4) |
profile |
Keyword(5) |
image processing |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Shinsuke Okita |
1st Author's Affiliation |
Keio University (Keio Univ.) |
2nd Author's Name |
Yuki Sato |
2nd Author's Affiliation |
Keio University (Keio Univ.) |
3rd Author's Name |
Yuki Sugata |
3rd Author's Affiliation |
Keio University (Keio Univ.) |
4th Author's Name |
Takuro Tasaka |
4th Author's Affiliation |
Keio University (Keio Univ.) |
5th Author's Name |
Nozomu Hamada |
5th Author's Affiliation |
Keio University (Keio Univ.) |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2012-03-08 12:45:00 |
Presentation Time |
25 minutes |
Registration for |
SIP |
Paper # |
CAS2011-112, SIP2011-132, CS2011-104 |
Volume (vol) |
vol.111 |
Number (no) |
no.465(CAS), no.466(SIP), no.467(CS) |
Page |
pp.31-36 |
#Pages |
6 |
Date of Issue |
2012-03-01 (CAS, SIP, CS) |
|