再帰的な選択的注意モデルにおける注意領域サイズの強化学習

村田 悠太朗; 寺前 順之介; 若宮 直紀

講演名	2017-06-24 再帰的な選択的注意モデルにおける注意領域サイズの強化学習村田悠太朗(阪大), 寺前順之介(阪大), 若宮直紀(阪大),
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	画像の一部を選択的に抽出して画像認識の精度を向上させる選択的注意の研究が近年盛んに行われている．再帰的ニューラルネットワークを用いて選択的注意を実現した先行研究では，抽出する画像領域（注意領域）を強化学習によって学習するが，注意領域サイズが固定されているため，認識できる物体の画像内での大きさが限定されてしまうという問題があった．この問題を解決するため，本研究では，注意領域サイズを可変にする選択的注意モデルを提案する．注意領域の位置のみを選択する先行研究では，注意領域位置のランダムサンプリングを通じて適切な注意領域位置を学習するが，注意領域の位置とサイズでは，サンプリングされる解空間の特性が異なるため，単純なサンプリングでは学習に失敗する．そこで本研究では注意領域サイズが持つ特性を考慮してサンプリングと強化学習手法の改良を行った．提案手法の妥当性を示すために，画像中に提示される文字サイズと選択された注意領域サイズの関係を計測し，さらにサイズの異なる複数の手書き数字のクラス分類を行った．
抄録(英)	Humans percept visual scenes with focusing only on significant parts of visual scenes sequentially ratherprocess whole of the visual stimuli entirely at once. Sequential visual attention attracts much attention recently in thefield of computer vision since it may largely reduce computational cost of image processing. A previous study proposedthe hard attention model, in which neural networks are trained to select suitable attended areas of visual fields forvisual recognition by using reinforcement learning. The model, however, cannot properly select attended area if sizesof significant areas in visual scenes are different from that of training data since it cannot modulate size of attendedareas. To solve the problem, here, we propose a network model that can learn both position and size of attended areaof visual field. Since phase space features are significantly different between position and size of a selected visual scene, they contribute to reinforcement signals in different manner, which causes undesirable bias and failure of learning. Toreduce the bias, we propose a new learning algorithm that almost keep balance of their contribution. We show thatour model can learn to suitably modulate the size of attended area and visual recognition with sequential attention.
キーワード(和)	強化学習 / 選択的注意 / 画像認識 / 再帰ニューラルネットワーク
キーワード(英)	reinforcement learning / visual attention / image recognition / recurrent neural network
資料番号	NC2017-18
発行日	2017-06-16 (NC)

研究会情報
研究会	NC / IPSJ-BIO / IBISML / IPSJ-MPS
開催期間	2017/6/23(から3日開催)
開催地（和）	沖縄科学技術大学院大学
開催地（英）	Okinawa Institute of Science and Technology
テーマ（和）	機械学習によるバイオデータマインニング、一般
テーマ（英）	Machine Learning Approach to Biodata Mining, and General
委員長氏名（和）	萩原将文(慶大) / / 福水健次(統計数理研)
委員長氏名（英）	Masafumi Hagiwara(Keio Univ.) / / Kenji Fukumizu(ISM)
副委員長氏名（和）	平田豊(中部大) / / 杉山将(東大)
副委員長氏名（英）	Yutaka Hirata(Chubu Univ.) / / Masashi Sugiyama(Univ. of Tokyo)
幹事氏名（和）	青西亨(東工大) / 吉川大弘(名大) / / 鹿島久嗣(京大) / 津田宏治(東大)
幹事氏名（英）	Toru Aonishi(Tokyo Inst. of Tech.) / Tomohiro Yoshikawa(Nagoya Univ.) / / Hisashi Kashima(Kyoto Univ.) / Koji Tsuda(Univ. of Tokyo)
幹事補佐氏名（和）	篠沢佳久(慶大) / 稲垣圭一郎(中部大) / / 竹内一郎(名工大) / 神嶌敏弘(産総研)
幹事補佐氏名（英）	Yoshihisa Shinozawa(Keio Univ.) / Keiichiro Inagaki(Chubu Univ.) / / Ichiro Takeuchi(Nagoya Inst. of Tech.) / Toshihiro Kamishima(AIST)

講演論文情報詳細
申込み研究会	Technical Committee on Neurocomputing / Special Interest Group on Bioinformatics and Genomics / Technical Committee on Infomation-Based Induction Sciences and Machine Learning / Special Interest Group on Mathematical Modeling and Problem Solving
本文の言語	JPN
タイトル（和）	再帰的な選択的注意モデルにおける注意領域サイズの強化学習
サブタイトル（和）
タイトル（英）	Reinforcement learning for visual attention with scalable size of attentional field
サブタイトル（和）
キーワード(1)（和/英）	強化学習 / reinforcement learning
キーワード(2)（和/英）	選択的注意 / visual attention
キーワード(3)（和/英）	画像認識 / image recognition
キーワード(4)（和/英）	再帰ニューラルネットワーク / recurrent neural network
第 1 著者氏名（和/英）	村田悠太朗 / Yutaro Murata
第 1 著者所属（和/英）	大阪大学(略称：阪大) Osaka University(略称：Osaka Univ.)
第 2 著者氏名（和/英）	寺前順之介 / Jun-nosuke Teramae
第 2 著者所属（和/英）	大阪大学(略称：阪大) Osaka University(略称：Osaka Univ.)
第 3 著者氏名（和/英）	若宮直紀 / Naoki Wakamiya
第 3 著者所属（和/英）	大阪大学(略称：阪大) Osaka University(略称：Osaka Univ.)
発表年月日	2017-06-24
資料番号	NC2017-18
巻番号（vol）	vol.117
号番号（no）	NC-109
ページ範囲	pp.121-125(NC),
ページ数	5
発行日	2017-06-16 (NC)