講演抄録/キーワード |
講演名 |
2010-09-06 10:40
Action planning for interactive visual scene understanding based on knowledge confidence defined on latent spaces Gurbachan Sekhon(Univ. of British Columbia)・○Akisato Kimura・Yasuhiro Minami・Hitoshi Sakano・Eisaku Maeda(NTT) PRMU2010-83 IBISML2010-55 |
抄録 |
(和) |
(事前公開アブストラクト) This report proposes a method for action planning in interactive visual scene understanding through the use of knowledge confidence generated from a latent space of a topic model connecting image features and text labels. We then use information, within the latent space, about the position of an input sample relative to training samples in order to simulate knowledge confidence. Coupled with this, we also use the overall associativity between each text label as determined by the content of the training samples to determine the knowledge confidence. |
(英) |
This report proposes a method for action planning in a system of interactive visual scene understanding through the use of system knowledge and its confidence. The knowledge confidence is defined as the combination of the following two properties on the latent space of a topic model connecting image features and text labels: 1) Similarity between an input sample and training samples on the latent space, and 2) the overall associability between each text label as determined by the content of the training samples. We evaluate the proposed method in the context of annotation accuracy and effort for providing answers from users. The experimental results with PASCAL VOC2008 dataset indicate that our proposed method achieved comparable or better annotation accuracy with less effort compared with strategies of 1) always asking the name of objects and 2) generating random questions. |
キーワード |
(和) |
/ / / / / / / |
(英) |
image annotation / human-computer interaction / action planning / reinforcement learning / topic model / knowledge confidence / / |
文献情報 |
信学技報, vol. 110, no. 187, PRMU2010-83, pp. 201-208, 2010年9月. |
資料番号 |
PRMU2010-83 |
発行日 |
2010-08-29 (PRMU, IBISML) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
著作権に ついて |
技術研究報告に掲載された論文の著作権は電子情報通信学会に帰属します.(許諾番号:10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
PDFダウンロード |
PRMU2010-83 IBISML2010-55 |