音素・口形素の特徴表現と敵対的生成ネットワークによる発話顔生成

Presentation	2019-03-18 Talking Head Generation with Deep Phoneme and Viseme Representation and Generative Adversarial Networks Takaaki Yasui, Yuta Nakashima, Noboru Babaguchi,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this paper, we propose to generate talking head given an audio input.Some existing methods generate photorealistic talking head by collecting a large amount of the target speaker's utterance video, but these methods are not applicable when only a small amount of video is available.Our method generates talking head for arbitrary speech of the target speaker with a small amount of utterance video.For doing this, we firstly extract phoneme features and viseme features from utterance video and map these features into a common space.We then train a generative adversarial network (GAN) to generate talking head from the phoneme feature on the common space.These networks are trained with less than 2 min of utterance video.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	talking head generation / phoneme / viseme / generative adversarial network
Paper #	BioX2018-53,PRMU2018-157
Date of Issue	2019-03-10 (BioX, PRMU)

Conference Information
Committee	PRMU / BioX
Conference Date	2019/3/17(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Shinichi Sato(NII) / Kazuhiko Sumi(AGU)
Vice Chair	Yoshihisa Ijiri(Omron) / Toru Tamaki(Hiroshima Univ.) / Hitoshi Imaoka(NEC) / Tetsushi Ohki(Shizuoka Univ.)
Secretary	Yoshihisa Ijiri(NEC) / Toru Tamaki(Osaka Univ.) / Hitoshi Imaoka(Fujitsu Labs.) / Tetsushi Ohki(Univ. of Electro-Comm.)
Assistant	Go Irie(NTT) / Yoshitaka Ushiku(Univ. of Tokyo) / Norihiro Okui(KDDI Research) / Daishi Watabe(Saitama Inst. of Tech.)

Paper Information
Registration To	Technical Committee on Pattern Recognition and Media Understanding / Technical Committee on Biometrics
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Talking Head Generation with Deep Phoneme and Viseme Representation and Generative Adversarial Networks
Sub Title (in English)
Keyword(1)	talking head generation
Keyword(2)	phoneme
Keyword(3)	viseme
Keyword(4)	generative adversarial network
1st Author's Name	Takaaki Yasui
1st Author's Affiliation	Osaka University(Osaka Univ.)
2nd Author's Name	Yuta Nakashima
2nd Author's Affiliation	Osaka University(Osaka Univ.)
3rd Author's Name	Noboru Babaguchi
3rd Author's Affiliation	Osaka University(Osaka Univ.)
Date	2019-03-18
Paper #	BioX2018-53,PRMU2018-157
Volume (vol)	vol.118
Number (no)	BioX-512,PRMU-513
Page	pp.pp.143-148(BioX), pp.143-148(PRMU),
#Pages	6
Date of Issue	2019-03-10 (BioX, PRMU)