Presentation | 2020-03-05 A study on image captioning considering its imageability Kazuki Umemura, Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi, Hiroshi Murase, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We propose an imageability-aware image captioning method tailoring generated captions to various applications. In this study, we first extend an existing image captioning dataset by augmenting its captions. Then, an imageability score for each augmented caption is calculated. A modified image captioning model is trained using this extended dataset to generate captions tailored to a specified imageability score. The evaluation shows the possibility that the extended dataset and the proposed method can generate imageability-aware captions. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Multimedia proccesing / image captioning / psycholinguistics / semantic gap |
Paper # | IMQ2019-48,IE2019-130,MVE2019-69 |
Date of Issue | 2020-02-27 (IMQ, IE, MVE) |
Conference Information | |
Committee | IE / IMQ / MVE / CQ |
---|---|
Conference Date | 2020/3/5(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Kyushu Institute of Technology |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Hideaki Kimata(NTT) / Toshiya Nakaguchi(Chiba Univ.) / Kenji Mase(Nagoya Univ.) / Hideyuki Shimonishi(NEC) |
Vice Chair | Kazuya Kodama(NII) / Keita Takahashi(Nagoya Univ.) / Mitsuru Maeda(Canon) / Kenya Uomori(Osaka Univ.) / Masayuki Ihara(NTT) / Jun Okamoto(NTT) / Takefumi Hiraguri(Nippon Inst. of Tech.) |
Secretary | Kazuya Kodama(NTT) / Keita Takahashi(NHK) / Mitsuru Maeda(Shizuoka Univ.) / Kenya Uomori(Sony Semiconductor Solutions) / Masayuki Ihara(Nagoya Univ.) / Jun Okamoto(NTT) / Takefumi Hiraguri(Nippon Inst. of Tech.) |
Assistant | Kyohei Unno(KDDI Research) / Norishige Fukushima(Nagoya Inst. of Tech.) / Hiroaki Kudo(Nagoya Univ.) / Masaru Tsuchida(NTT) / Keita Hirai(Chiba Univ.) / Satoshi Nishiguchi(Oosaka Inst. of Tech.) / Masanori Yokoyama(NTT) / Shogo Fukushima(Univ. of ToKyo) / Chikara Sasaki(KDDI Research) / Yoshiaki Nishikawa(NEC) / Takuto Kimura(NTT) |
Paper Information | |
Registration To | Technical Committee on Image Engineering / Technical Committee on Image Media Quality / Technical Committee on Media Experience and Virtual Environment / Technical Committee on Communication Quality |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A study on image captioning considering its imageability |
Sub Title (in English) | |
Keyword(1) | Multimedia proccesing |
Keyword(2) | image captioning |
Keyword(3) | psycholinguistics |
Keyword(4) | semantic gap |
1st Author's Name | Kazuki Umemura |
1st Author's Affiliation | Nagoya University(Nagoya Univ.) |
2nd Author's Name | Marc Aurel Kastner |
2nd Author's Affiliation | Nagoya University(Nagoya Univ.) |
3rd Author's Name | Ichiro Ide |
3rd Author's Affiliation | Nagoya University(Nagoya Univ.) |
4th Author's Name | Yasutomo Kawanishi |
4th Author's Affiliation | Nagoya University(Nagoya Univ.) |
5th Author's Name | Takatsugu Hirayama |
5th Author's Affiliation | Nagoya University(Nagoya Univ.) |
6th Author's Name | Keisuke Doman |
6th Author's Affiliation | Chukyo University(Chukyo Univ.) |
7th Author's Name | Daisuke Deguchi |
7th Author's Affiliation | Nagoya University(Nagoya Univ.) |
8th Author's Name | Hiroshi Murase |
8th Author's Affiliation | Nagoya University(Nagoya Univ.) |
Date | 2020-03-05 |
Paper # | IMQ2019-48,IE2019-130,MVE2019-69 |
Volume (vol) | vol.119 |
Number (no) | IMQ-454,IE-456,MVE-457 |
Page | pp.pp.165-169(IMQ), pp.165-169(IE), pp.165-169(MVE), |
#Pages | 5 |
Date of Issue | 2020-02-27 (IMQ, IE, MVE) |