Presentation | 2001/1/4 Autonomous Composition of the State Space in Reinforcement Learning Takemasa SHIBA, Takashi ISHIKAWA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Reinforcement learning acquires the optimum action rules with the agent himself for the remuneration given from the environment.Since it is decided from the state which an agent can recognize, if the action rules for which it is asked has not conformed the optimum state space, it cannot gain the optimum action rules.There is a technique into which the agent itself divides state space autonomously as the method of solving this problem.This paper, describes a method of dividing state space based on the forecasting model of the state change using statistical alignment approximation as the technique of constituting state space autonomously.When environment changes, the improvement for building state space is described.Moreover, the improved technique is applied to the cart and pole problem and the result is shown. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | reinforcement learning / state space construction / dynamic environment / multi regression / clustering |
Paper # | AI2000-65 |
Date of Issue |
Conference Information | |
Committee | AI |
---|---|
Conference Date | 2001/1/4(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Artificial Intelligence and Knowledge-Based Processing (AI) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Autonomous Composition of the State Space in Reinforcement Learning |
Sub Title (in English) | |
Keyword(1) | reinforcement learning |
Keyword(2) | state space construction |
Keyword(3) | dynamic environment |
Keyword(4) | multi regression |
Keyword(5) | clustering |
1st Author's Name | Takemasa SHIBA |
1st Author's Affiliation | Nippon Institute of Technology() |
2nd Author's Name | Takashi ISHIKAWA |
2nd Author's Affiliation | Nippon Institute of Technology |
Date | 2001/1/4 |
Paper # | AI2000-65 |
Volume (vol) | vol.100 |
Number (no) | 530 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |