Presentation | 2023-09-21 BoxPlotQA: Visual Question Answering for Measuring Five-Number Summary and Comparison Performance with Box Plot Yusuke Tozaki, Hisashi Miyamori, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Recently, visual question and answer (VQA) research on document and chart images, as well as natural images, has attracted much attention.In particular, there have been many studies on chart images that visualize quantities or proportions, such as bar charts, pie charts, and line graphs.However, chart images that visualize data variability, such as histograms and box plots (box-and-whisker), have not received much attention.In this paper, we propose a VQA task BoxPlotQA for box plot images and construct a new benchmark dataset for the task.Box plot images differ from other chart image VQAs in that they allow us to measure the ability to accurately read visual features such as value, width, and symmetry, and to compare these features across multiple data sets.In our experiments, we will test the effect on performance of several baseline models with and without training on the BoxPlotQA dataset, as well as the performance of different question types on real-world observed data.This study is expected to facilitate the analysis and improvement of the ability of visual language models to accurately read chart images. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Box Plot / Chart Question Answering / Visual Question Answering / Vision Language Model |
Paper # | DE2023-16 |
Date of Issue | 2023-09-14 (DE) |
Conference Information | |
Committee | DE / IPSJ-DBS / IPSJ-IFAT |
---|---|
Conference Date | 2023/9/21(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Kitakyushu International Conference Center |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Bigdata management, information retrieval, knowledge discovery, etc. |
Chair | Masashi Toyoda(Univ. of Tokyo) |
Vice Chair | Kosuke Takano(Kanagawa Inst. of Tech.) / Chiemi Watanabe(Tsukuba Univ. of Technology) |
Secretary | Kosuke Takano(Univ. of Tsukuba) / Chiemi Watanabe(Komazawa Univ.) |
Assistant | Takahiro Komamizu(Nagoya Univ.) |
Paper Information | |
Registration To | Technical Committee on Data Engineering / Special Interest Group on Database System / Special Interest Group on Information Fundamentals and Access Technologies |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | BoxPlotQA: Visual Question Answering for Measuring Five-Number Summary and Comparison Performance with Box Plot |
Sub Title (in English) | |
Keyword(1) | Box Plot |
Keyword(2) | Chart Question Answering |
Keyword(3) | Visual Question Answering |
Keyword(4) | Vision Language Model |
1st Author's Name | Yusuke Tozaki |
1st Author's Affiliation | Kyoto Sangyo University(Kyoto Sangyo Univ.) |
2nd Author's Name | Hisashi Miyamori |
2nd Author's Affiliation | Kyoto Sangyo University(Kyoto Sangyo Univ.) |
Date | 2023-09-21 |
Paper # | DE2023-16 |
Volume (vol) | vol.123 |
Number (no) | DE-192 |
Page | pp.pp.31-36(DE), |
#Pages | 6 |
Date of Issue | 2023-09-14 (DE) |