Presentation 2021-03-04
VQA for Medical Image Data based on Image Feature Extraction and Fusion
Hideo Umada, Masaki Aono,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In recent years, there has been a remarkable growth in research on deep learning in the fields of computer vision and natural language processing, and there are growing expectations about the application of artificial intelligence in various fields. As a result, there is a growing demand for research on the VQA-Med task, which is an application of Visual QA, a research that requires both computer vision and natural language processing techniques, to the medical field. Medical images include images from various modalities such as X-ray images, MRI images, and CT images. In this study, we consider QA problems as classification problems and propose a method for obtaining effective features for a variety of medical images and a feature synthesis method, Feature Fusion Networks, for learning multimodal relationships between images and questions. Using the VQA-Med2020 dataset, we experimented with and evaluated the system, and reported on the new findings.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Deep Learning / Visual Question-Answering / Medical Image / Feature Extraction
Paper # PRMU2020-81
Date of Issue 2021-02-25 (PRMU)

Conference Information
Committee PRMU / IPSJ-CVIM
Conference Date 2021/3/4(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Online
Topics (in Japanese) (See Japanese page)
Topics (in English) Computer Vision and Pattern Recognition for specific environment
Chair Yoichi Sato(Univ. of Tokyo)
Vice Chair Akisato Kimura(NTT) / Masakazu Iwamura(Osaka Pref. Univ.)
Secretary Akisato Kimura(Mobility Technologies) / Masakazu Iwamura(Chubu Univ.)
Assistant Takashi Shibata(NTT) / Masashi Nishiyama(Tottori Univ.)

Paper Information
Registration To Technical Committee on Pattern Recognition and Media Understanding / Special Interest Group on Computer Vision and Image Media
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) VQA for Medical Image Data based on Image Feature Extraction and Fusion
Sub Title (in English)
Keyword(1) Deep Learning
Keyword(2) Visual Question-Answering
Keyword(3) Medical Image
Keyword(4) Feature Extraction
1st Author's Name Hideo Umada
1st Author's Affiliation Toyohashi University of Technology(TUT)
2nd Author's Name Masaki Aono
2nd Author's Affiliation Toyohashi University of Technology(TUT)
Date 2021-03-04
Paper # PRMU2020-81
Volume (vol) vol.120
Number (no) PRMU-409
Page pp.pp.71-76(PRMU),
#Pages 6
Date of Issue 2021-02-25 (PRMU)