Presentation 2018-03-09
Efficient and Interactive Image Retrieval Based on Semantic Segmentation
Ryosuke Furuta, Naoto Inoue, Toshihiko Yamasaki,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper proposes an efficient image retrieval system. When users wish to retrieve images with semantic and spatial constraints (eg, a horse is located at the center of the image, and a person is riding on the horse), it is difficult for conventional text-based retrieval systems to retrieve such images exactly. In contrast, the proposed system can consider both semantic and spatial information, because it is based on semantic segmentation using fully convolutional networks (FCN). The proposed system can accept three types of images as queries: a segmentation map sketched by the user, a natural image, or a combination of the two. The distance between the query and each image in the database is calculated based on the output probability maps from the FCN. In order to make the system efficient in terms of both the computation time and memory usage, we employ the product quantization technique (PQ). The experimental results show that the PQ is compatible with the FCN-based image retrieval system, and that the quantization process results in little information loss. It is also shown that our method outperforms a conventional text-based search system.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Image retrieval / Fully convolutional networks / Semantic segmentation / Product quantization
Paper # IMQ2017-59,IE2017-151,MVE2017-101
Date of Issue 2018-03-01 (IMQ, IE, MVE)

Conference Information
Committee CQ / MVE / IE / IMQ
Conference Date 2018/3/8(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Industry Support Center
Topics (in Japanese) (See Japanese page)
Topics (in English) Five Senses Media, Cooking and Eating Activities Media, Multimedia, Media Experience, Video Encoding, Image Media Quality, Network Quality and Reliability, etc. (Co-sponsor: Technical Committee on Multimedia on Cooking and Eating Activities (CEA))
Chair Takanori Hayashi(Hiroshima Inst. of Tech.) / Yoshinari Kameda(Univ. of Tsukuba) / Takayuki Hamamoto(Tokyo Univ. of Science) / Kenji Sugiyama(Seikei Univ.)
Vice Chair Hideyuki Shimonishi(NEC) / Jun Okamoto(NTT) / Kenji Mase(Nagoya Univ.) / Kazuya Kodama(NII) / Hideaki Kimata(NTT) / Toshiya Nakaguchi(Chiba Univ.) / Mitsuru Maeda(Canon)
Secretary Hideyuki Shimonishi(NTT) / Jun Okamoto(Keio Univ.) / Kenji Mase(Kyoto Univ.) / Kazuya Kodama(NTT) / Hideaki Kimata(Kyushu Univ.) / Toshiya Nakaguchi(Nagoya Univ.) / Mitsuru Maeda(KDDI Research)
Assistant Kenko Ota(Nippon Inst. of Tech.) / Norihiro Fukumoto(KDDI Research Inc.) / Ryo Yamamoto(UEC) / Takatsugu Hirayama(Nagoya Univ.) / Ryosuke Aoki(NTT) / Yasutaka Matsuo(NHK) / Kazuya Hayase(NTT) / Masaru Tsuchida(NTT) / Gosuke Ohashi(Shizuoka Univ.)

Paper Information
Registration To Technical Committee on Communication Quality / Technical Committee on Media Experience and Virtual Environment / Technical Committee on Image Engineering / Technical Committee on Image Media Quality
Language ENG-JTITLE
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Efficient and Interactive Image Retrieval Based on Semantic Segmentation
Sub Title (in English)
Keyword(1) Image retrieval
Keyword(2) Fully convolutional networks
Keyword(3) Semantic segmentation
Keyword(4) Product quantization
1st Author's Name Ryosuke Furuta
1st Author's Affiliation The University of Tokyo(Univ. of Tokyo)
2nd Author's Name Naoto Inoue
2nd Author's Affiliation The University of Tokyo(Univ. of Tokyo)
3rd Author's Name Toshihiko Yamasaki
3rd Author's Affiliation The University of Tokyo(Univ. of Tokyo)
Date 2018-03-09
Paper # IMQ2017-59,IE2017-151,MVE2017-101
Volume (vol) vol.117
Number (no) IMQ-483,IE-484,MVE-485
Page pp.pp.189-194(IMQ), pp.189-194(IE), pp.189-194(MVE),
#Pages 6
Date of Issue 2018-03-01 (IMQ, IE, MVE)