Presentation 2001/1/4
Autonomous Composition of the State Space in Reinforcement Learning
Takemasa SHIBA, Takashi ISHIKAWA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Reinforcement learning acquires the optimum action rules with the agent himself for the remuneration given from the environment.Since it is decided from the state which an agent can recognize, if the action rules for which it is asked has not conformed the optimum state space, it cannot gain the optimum action rules.There is a technique into which the agent itself divides state space autonomously as the method of solving this problem.This paper, describes a method of dividing state space based on the forecasting model of the state change using statistical alignment approximation as the technique of constituting state space autonomously.When environment changes, the improvement for building state space is described.Moreover, the improved technique is applied to the cart and pole problem and the result is shown.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) reinforcement learning / state space construction / dynamic environment / multi regression / clustering
Paper # AI2000-65
Date of Issue

Conference Information
Committee AI
Conference Date 2001/1/4(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Autonomous Composition of the State Space in Reinforcement Learning
Sub Title (in English)
Keyword(1) reinforcement learning
Keyword(2) state space construction
Keyword(3) dynamic environment
Keyword(4) multi regression
Keyword(5) clustering
1st Author's Name Takemasa SHIBA
1st Author's Affiliation Nippon Institute of Technology()
2nd Author's Name Takashi ISHIKAWA
2nd Author's Affiliation Nippon Institute of Technology
Date 2001/1/4
Paper # AI2000-65
Volume (vol) vol.100
Number (no) 530
Page pp.pp.-
#Pages 6
Date of Issue