複数の状態予測と報酬予測モデルによる強化学習と行動目標の推定

Presentation	2002/1/22 Reinforcement Learning and Goal Estimation by Multiple Forward and Reward Models Norikazu SUGIMOTO, Kazuyuki SAMEJIMA, Kenji DOYA, Mitsuo KAWATO,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This reseach presents a new reinforcement learning framework, "Combinatorial Model-based Reinforcement Learning (CMRL)", which flexibly combines forward models, reward models, and contorollers. First, appropriate forward models and reward models are selected based on the correctness of their predictions. Then an appropriate controller is selected based on the TD-error given by the models and the controllers. A similar module selection method can be applied to imitation learning that takes into account the difference in the parameters of the learner and the teacher.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	module learning / reinforcement learning / nonlinear control / imitation learning
Paper #
Date of Issue

Paper Information
Registration To	Neurocomputing (NC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Reinforcement Learning and Goal Estimation by Multiple Forward and Reward Models
Sub Title (in English)
Keyword(1)	module learning
Keyword(2)	reinforcement learning
Keyword(3)	nonlinear control
Keyword(4)	imitation learning
1st Author's Name	Norikazu SUGIMOTO
1st Author's Affiliation	NAra Institute of Science and Technology:ATR, Human Information Science Laboratories, Department 3:Creating the Brain, CREST, Japan Science and Technology Corporation()
2nd Author's Name	Kazuyuki SAMEJIMA
2nd Author's Affiliation	ATR, Human Information Science Laboratories, Department 3:Creating the Brain, CREST, Japan Science and Technology Corporation
3rd Author's Name	Kenji DOYA
3rd Author's Affiliation	NAra Institute of Science and Technology:ATR, Human Information Science Laboratories, Department 3:Creating the Brain, CREST, Japan Science and Technology Corporation
4th Author's Name	Mitsuo KAWATO
4th Author's Affiliation	NAra Institute of Science and Technology:ATR, Human Information Science Laboratories, Department 3
Date	2002/1/22
Paper #
Volume (vol)	vol.101
Number (no)	616
Page	pp.pp.-
#Pages	8
Date of Issue