Presentation | 2023-03-02 A Study of Word Lip-Reading using Meta Learnin Michinari Kodama, Takeshi Saitoh, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Lip-reading technology, which estimates utterance content using only visual information, is a kind of supervised learning, and a large-scale data set is desired. However, collecting utterance scenes is costly. Therefore, in this paper, in order to reduce the collection cost, we consider a method that uses meta learning in the approach of learning with a small number of data. Recognition experiments were conducted using several meta learning methods such as ProtoNet and DeepBDC using three datasets: public datasets LRW and SSSD for lip-reading, and public action recognition dataset UCF101 for comparison. As a result, compared to UCF101, LRW and SSSD had lower recognition accuracy. In this paper, we report the experimental results. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Few-shot learning / meta learning / lip-reading / word |
Paper # | PRMU2022-77,IBISML2022-84 |
Date of Issue | 2023-02-23 (PRMU, IBISML) |
Conference Information | |
Committee | PRMU / IBISML / IPSJ-CVIM |
---|---|
Conference Date | 2023/3/2(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Future University Hakodate |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Seiichi Uchida(Kyushu Univ.) / Masashi Sugiyama(Univ. of Tokyo) |
Vice Chair | Takuya Funatomi(NAIST) / Mitsuru Anpai(Denso IT Lab.) / Toshihiro Kamishima(AIST) / Koji Tsuda(Univ. of Tokyo) |
Secretary | Takuya Funatomi(CyberAgent) / Mitsuru Anpai(Univ. of Tokyo) / Toshihiro Kamishima(NTT) / Koji Tsuda(Hokkaido Univ.) |
Assistant | Nakamasa Inoue(Tokyo Inst. of Tech.) / Yasutomo Kawanishi(Riken) / Yoshinobu Kawahara(Osaka Univ.) / Taiji Suzuki(Tokyo Inst. of Tech.) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding / Technical Committee on Information-Based Induction Sciences and Machine Learning / Special Interest Group on Computer Vision and Image Media |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Study of Word Lip-Reading using Meta Learnin |
Sub Title (in English) | |
Keyword(1) | Few-shot learning |
Keyword(2) | meta learning |
Keyword(3) | lip-reading |
Keyword(4) | word |
1st Author's Name | Michinari Kodama |
1st Author's Affiliation | Kyushu Institute of Technology(kyutech) |
2nd Author's Name | Takeshi Saitoh |
2nd Author's Affiliation | Kyushu Institute of Technology(kyutech) |
Date | 2023-03-02 |
Paper # | PRMU2022-77,IBISML2022-84 |
Volume (vol) | vol.122 |
Number (no) | PRMU-404,IBISML-405 |
Page | pp.pp.102-106(PRMU), pp.102-106(IBISML), |
#Pages | 5 |
Date of Issue | 2023-02-23 (PRMU, IBISML) |