Presentation 2023-03-02
A Study of Word Lip-Reading using Meta Learnin
Michinari Kodama, Takeshi Saitoh,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Lip-reading technology, which estimates utterance content using only visual information, is a kind of supervised learning, and a large-scale data set is desired. However, collecting utterance scenes is costly. Therefore, in this paper, in order to reduce the collection cost, we consider a method that uses meta learning in the approach of learning with a small number of data. Recognition experiments were conducted using several meta learning methods such as ProtoNet and DeepBDC using three datasets: public datasets LRW and SSSD for lip-reading, and public action recognition dataset UCF101 for comparison. As a result, compared to UCF101, LRW and SSSD had lower recognition accuracy. In this paper, we report the experimental results.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Few-shot learning / meta learning / lip-reading / word
Paper # PRMU2022-77,IBISML2022-84
Date of Issue 2023-02-23 (PRMU, IBISML)

Conference Information
Committee PRMU / IBISML / IPSJ-CVIM
Conference Date 2023/3/2(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Future University Hakodate
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Seiichi Uchida(Kyushu Univ.) / Masashi Sugiyama(Univ. of Tokyo)
Vice Chair Takuya Funatomi(NAIST) / Mitsuru Anpai(Denso IT Lab.) / Toshihiro Kamishima(AIST) / Koji Tsuda(Univ. of Tokyo)
Secretary Takuya Funatomi(CyberAgent) / Mitsuru Anpai(Univ. of Tokyo) / Toshihiro Kamishima(NTT) / Koji Tsuda(Hokkaido Univ.)
Assistant Nakamasa Inoue(Tokyo Inst. of Tech.) / Yasutomo Kawanishi(Riken) / Yoshinobu Kawahara(Osaka Univ.) / Taiji Suzuki(Tokyo Inst. of Tech.)

Paper Information
Registration To Technical Committee on Pattern Recognition and Media Understanding / Technical Committee on Information-Based Induction Sciences and Machine Learning / Special Interest Group on Computer Vision and Image Media
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Study of Word Lip-Reading using Meta Learnin
Sub Title (in English)
Keyword(1) Few-shot learning
Keyword(2) meta learning
Keyword(3) lip-reading
Keyword(4) word
1st Author's Name Michinari Kodama
1st Author's Affiliation Kyushu Institute of Technology(kyutech)
2nd Author's Name Takeshi Saitoh
2nd Author's Affiliation Kyushu Institute of Technology(kyutech)
Date 2023-03-02
Paper # PRMU2022-77,IBISML2022-84
Volume (vol) vol.122
Number (no) PRMU-404,IBISML-405
Page pp.pp.102-106(PRMU), pp.102-106(IBISML),
#Pages 5
Date of Issue 2023-02-23 (PRMU, IBISML)