Cross-modal CycleGAN for Low-Resource Anime Style Face Translation

Shiping Deng; 内田 薫

Presentation	2021-10-09 Cross-modal CycleGAN for Low-Resource Anime Style Face Translation Shiping Deng, Kaoru Uchida,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Human face to anime face translation has attracted the attention of many researchers in recentyears, and various works have achieved high-quality style transfer on conventional tasks. However, existing works often have fatal shortcomings when the target domain training data is heavilyinsufficient, which is named as imbalanced (low-resource) setting. Here the low-resource task, generally means there is no sufficient data on the training dataset compared with the conventionaltask, e.g. the training data size is fewer than 100. To solve this problem, we propose a multimodallow-resource translation model for a specific style. Based on the cyclic adversarial networkand class activation map, we import semantic modality to enhance data information and attentionmodules, which will help our model focus more on the important areas distinguishing the sourcedomain from the target domain. Unlike the previous unsupervised learning of single modality, ourmodel successfully completes image translation in cross-modal situations by importing pre-trainedtext-image alignment model. In addition, the use of an asymmetric structure speeds up training andflexibly generates images of the target domain. The experimental results show that our method hassuperiority in low-resource settings compared with the existing work of the same type.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Data-imbalanced / style translation / deep neural network
Paper #	PRMU2021-18
Date of Issue	2021-10-01 (PRMU)

Conference Information
Committee	PRMU
Conference Date	2021/10/8(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Online
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Processes and technologies to make research more efficient
Chair	Seiichi Uchida(Kyushu Univ.)
Vice Chair	Masakazu Iwamura(Osaka Pref. Univ.) / Mitsuru Anpai(Denso IT Lab.)
Secretary	Masakazu Iwamura(NTT) / Mitsuru Anpai(Tottori Univ.)
Assistant	Kouta Yamaguchi(CyberAgent) / Yusuke Matsui(Univ. of Tokyo)

Paper Information
Registration To	Technical Committee on Pattern Recognition and Media Understanding
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Cross-modal CycleGAN for Low-Resource Anime Style Face Translation
Sub Title (in English)
Keyword(1)	Data-imbalanced
Keyword(2)	style translation
Keyword(3)	deep neural network
1st Author's Name	Shiping Deng
1st Author's Affiliation	Hosei University/University of Science and Technology of China(Hosei Univ./USTC)
2nd Author's Name	Kaoru Uchida
2nd Author's Affiliation	Hosei University(Hosei Univ.)
Date	2021-10-09
Paper #	PRMU2021-18
Volume (vol)	vol.121
Number (no)	PRMU-192
Page	pp.pp.11-16(PRMU),
#Pages	6
Date of Issue	2021-10-01 (PRMU)