強化学習における状態空間の自律的構成方法

Presentation	2001/1/4 Autonomous Composition of the State Space in Reinforcement Learning Takemasa SHIBA, Takashi ISHIKAWA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Reinforcement learning acquires the optimum action rules with the agent himself for the remuneration given from the environment.Since it is decided from the state which an agent can recognize, if the action rules for which it is asked has not conformed the optimum state space, it cannot gain the optimum action rules.There is a technique into which the agent itself divides state space autonomously as the method of solving this problem.This paper, describes a method of dividing state space based on the forecasting model of the state change using statistical alignment approximation as the technique of constituting state space autonomously.When environment changes, the improvement for building state space is described.Moreover, the improved technique is applied to the cart and pole problem and the result is shown.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	reinforcement learning / state space construction / dynamic environment / multi regression / clustering
Paper #	AI2000-65
Date of Issue

Paper Information
Registration To	Artificial Intelligence and Knowledge-Based Processing (AI)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Autonomous Composition of the State Space in Reinforcement Learning
Sub Title (in English)
Keyword(1)	reinforcement learning
Keyword(2)	state space construction
Keyword(3)	dynamic environment
Keyword(4)	multi regression
Keyword(5)	clustering
1st Author's Name	Takemasa SHIBA
1st Author's Affiliation	Nippon Institute of Technology()
2nd Author's Name	Takashi ISHIKAWA
2nd Author's Affiliation	Nippon Institute of Technology
Date	2001/1/4
Paper #	AI2000-65
Volume (vol)	vol.100
Number (no)	530
Page	pp.pp.-
#Pages	6
Date of Issue