Paper Abstract and Keywords |
Presentation |
2022-03-04 09:45
A Study on Silent Word Recognition Based on Deep Learning Using Facial 3D Model Ryuji Wada, Kenko Ota (NIT) MICT2021-103 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
The aim of this study is to propose a method to realize silent word recognition removing the constraint on face orientation, and to clarify the effectiveness of the proposed method. In recent years, research on lip reading using mouth image based on deep learning has been developed. In the case of methods using a camera, the distance, position, orientation, etc. of the face from the camera are often fixed. However, it is difficult to hold the face orientation when we use the lip reading system in an online conference. Hence, we created utterance videos with the face turning in various orientations using a 3D model, and used these data as learning data for deep learning. As a result of the performance evaluation experiment, the recognition performance was improved compared to the case where the 3D model was not used. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
3Dmodel / deep learning / lipreading / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 121, no. 404, MICT2021-103, pp. 13-18, March 2022. |
Paper # |
MICT2021-103 |
Date of Issue |
2022-02-25 (MICT) |
ISSN |
Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
MICT2021-103 |
Conference Information |
Committee |
MICT EMCJ |
Conference Date |
2022-03-04 - 2022-03-04 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Online |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Healthcare and Medical Information Communication Technologies, EMC, etc |
Paper Information |
Registration To |
MICT |
Conference Code |
2022-03-MICT-EMCJ |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
A Study on Silent Word Recognition Based on Deep Learning Using Facial 3D Model |
Sub Title (in English) |
|
Keyword(1) |
3Dmodel |
Keyword(2) |
deep learning |
Keyword(3) |
lipreading |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Ryuji Wada |
1st Author's Affiliation |
Nippon Institute Of Technology (NIT) |
2nd Author's Name |
Kenko Ota |
2nd Author's Affiliation |
Nippon Institute Of Technology (NIT) |
3rd Author's Name |
|
3rd Author's Affiliation |
() |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2022-03-04 09:45:00 |
Presentation Time |
20 minutes |
Registration for |
MICT |
Paper # |
MICT2021-103 |
Volume (vol) |
vol.121 |
Number (no) |
no.404 |
Page |
pp.13-18 |
#Pages |
6 |
Date of Issue |
2022-02-25 (MICT) |
|