Presentation | 2021-03-04 VQA for Medical Image Data based on Image Feature Extraction and Fusion Hideo Umada, Masaki Aono, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In recent years, there has been a remarkable growth in research on deep learning in the fields of computer vision and natural language processing, and there are growing expectations about the application of artificial intelligence in various fields. As a result, there is a growing demand for research on the VQA-Med task, which is an application of Visual QA, a research that requires both computer vision and natural language processing techniques, to the medical field. Medical images include images from various modalities such as X-ray images, MRI images, and CT images. In this study, we consider QA problems as classification problems and propose a method for obtaining effective features for a variety of medical images and a feature synthesis method, Feature Fusion Networks, for learning multimodal relationships between images and questions. Using the VQA-Med2020 dataset, we experimented with and evaluated the system, and reported on the new findings. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Deep Learning / Visual Question-Answering / Medical Image / Feature Extraction |
Paper # | PRMU2020-81 |
Date of Issue | 2021-02-25 (PRMU) |
Conference Information | |
Committee | PRMU / IPSJ-CVIM |
---|---|
Conference Date | 2021/3/4(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Computer Vision and Pattern Recognition for specific environment |
Chair | Yoichi Sato(Univ. of Tokyo) |
Vice Chair | Akisato Kimura(NTT) / Masakazu Iwamura(Osaka Pref. Univ.) |
Secretary | Akisato Kimura(Mobility Technologies) / Masakazu Iwamura(Chubu Univ.) |
Assistant | Takashi Shibata(NTT) / Masashi Nishiyama(Tottori Univ.) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding / Special Interest Group on Computer Vision and Image Media |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | VQA for Medical Image Data based on Image Feature Extraction and Fusion |
Sub Title (in English) | |
Keyword(1) | Deep Learning |
Keyword(2) | Visual Question-Answering |
Keyword(3) | Medical Image |
Keyword(4) | Feature Extraction |
1st Author's Name | Hideo Umada |
1st Author's Affiliation | Toyohashi University of Technology(TUT) |
2nd Author's Name | Masaki Aono |
2nd Author's Affiliation | Toyohashi University of Technology(TUT) |
Date | 2021-03-04 |
Paper # | PRMU2020-81 |
Volume (vol) | vol.120 |
Number (no) | PRMU-409 |
Page | pp.pp.71-76(PRMU), |
#Pages | 6 |
Date of Issue | 2021-02-25 (PRMU) |