Presentation 2020-03-05
A study on image captioning considering its imageability
Kazuki Umemura, Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi, Hiroshi Murase,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We propose an imageability-aware image captioning method tailoring generated captions to various applications. In this study, we first extend an existing image captioning dataset by augmenting its captions. Then, an imageability score for each augmented caption is calculated. A modified image captioning model is trained using this extended dataset to generate captions tailored to a specified imageability score. The evaluation shows the possibility that the extended dataset and the proposed method can generate imageability-aware captions.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Multimedia proccesing / image captioning / psycholinguistics / semantic gap
Paper # IMQ2019-48,IE2019-130,MVE2019-69
Date of Issue 2020-02-27 (IMQ, IE, MVE)

Conference Information
Committee IE / IMQ / MVE / CQ
Conference Date 2020/3/5(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Kyushu Institute of Technology
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Hideaki Kimata(NTT) / Toshiya Nakaguchi(Chiba Univ.) / Kenji Mase(Nagoya Univ.) / Hideyuki Shimonishi(NEC)
Vice Chair Kazuya Kodama(NII) / Keita Takahashi(Nagoya Univ.) / Mitsuru Maeda(Canon) / Kenya Uomori(Osaka Univ.) / Masayuki Ihara(NTT) / Jun Okamoto(NTT) / Takefumi Hiraguri(Nippon Inst. of Tech.)
Secretary Kazuya Kodama(NTT) / Keita Takahashi(NHK) / Mitsuru Maeda(Shizuoka Univ.) / Kenya Uomori(Sony Semiconductor Solutions) / Masayuki Ihara(Nagoya Univ.) / Jun Okamoto(NTT) / Takefumi Hiraguri(Nippon Inst. of Tech.)
Assistant Kyohei Unno(KDDI Research) / Norishige Fukushima(Nagoya Inst. of Tech.) / Hiroaki Kudo(Nagoya Univ.) / Masaru Tsuchida(NTT) / Keita Hirai(Chiba Univ.) / Satoshi Nishiguchi(Oosaka Inst. of Tech.) / Masanori Yokoyama(NTT) / Shogo Fukushima(Univ. of ToKyo) / Chikara Sasaki(KDDI Research) / Yoshiaki Nishikawa(NEC) / Takuto Kimura(NTT)

Paper Information
Registration To Technical Committee on Image Engineering / Technical Committee on Image Media Quality / Technical Committee on Media Experience and Virtual Environment / Technical Committee on Communication Quality
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A study on image captioning considering its imageability
Sub Title (in English)
Keyword(1) Multimedia proccesing
Keyword(2) image captioning
Keyword(3) psycholinguistics
Keyword(4) semantic gap
1st Author's Name Kazuki Umemura
1st Author's Affiliation Nagoya University(Nagoya Univ.)
2nd Author's Name Marc Aurel Kastner
2nd Author's Affiliation Nagoya University(Nagoya Univ.)
3rd Author's Name Ichiro Ide
3rd Author's Affiliation Nagoya University(Nagoya Univ.)
4th Author's Name Yasutomo Kawanishi
4th Author's Affiliation Nagoya University(Nagoya Univ.)
5th Author's Name Takatsugu Hirayama
5th Author's Affiliation Nagoya University(Nagoya Univ.)
6th Author's Name Keisuke Doman
6th Author's Affiliation Chukyo University(Chukyo Univ.)
7th Author's Name Daisuke Deguchi
7th Author's Affiliation Nagoya University(Nagoya Univ.)
8th Author's Name Hiroshi Murase
8th Author's Affiliation Nagoya University(Nagoya Univ.)
Date 2020-03-05
Paper # IMQ2019-48,IE2019-130,MVE2019-69
Volume (vol) vol.119
Number (no) IMQ-454,IE-456,MVE-457
Page pp.pp.165-169(IMQ), pp.165-169(IE), pp.165-169(MVE),
#Pages 5
Date of Issue 2020-02-27 (IMQ, IE, MVE)