Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning

講演名	2009-07-13 Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning ,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)
抄録(英)	Learning a dynamics model and a reward model during reinforcement learning is a useful way, since the agent can also update its value function by using the models. In this paper, we propose a general dynamics model that is a composition of the feature space dynamics model and the state space dynamics model. This way enables to obtain a good generalization from a small number of samples because of the linearity of the state space dynamics, while it does not lose the accuracy. We demonstrate the simulation comparison of some dynamics models used together with a Dyna algorithm.
キーワード(和)
キーワード(英)	Model-based reinforcement learning / Dyna-style planning / prioritized sweeping / dynamics model
資料番号	NLP2009-15,NC2009-8
発行日

講演論文情報詳細
申込み研究会	Neurocomputing (NC)
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	Composition of Feature Space and State Space Dynamics Models for Model-based Reinforcement Learning
サブタイトル（和）
キーワード(1)（和/英）	/ Model-based reinforcement learning
第 1 著者氏名（和/英）	/ Akihiko YAMAGUCHI
第 1 著者所属（和/英）	Graduate School of Information Science, Nara Institute of Science and Technology
発表年月日	2009-07-13
資料番号	NLP2009-15,NC2009-8
巻番号（vol）	vol.109
号番号（no）	125
ページ範囲	pp.-
ページ数	6
発行日