Presentation | 2006-06-16 Feature Extraction for Decision-Theoretic Planning in Partially Observable Stochastic Domains Hajime FUJITA, Yutaka NAKAMURA, Shin ISHII, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We propose a feature extraction technique for decision-theoretic planning problems in partially observable stochastic domains and show a novel approach for solving them. To maximize an expected future reward in unknown environment, all the agent has to do is to estimate a Markov chain over a statistic variable related to rewards. In our approach, an auxiliary state variable whose stochastic process satisfies the Markov property, called internal state, is introduced to the model with the assumption that the rewards are dependent on a pair of an internal state and an action. The agent then estimates the dynamics of an internal state model based on the maximum likelihood inference along with acquiring its policy; the internal state model represents an essential feature necessary to decision-making. Computer simulation results show that our technique can find an appropriate feature for acquiring a good policy and achieve faster learning with fewer policy parameters than a conventional algorithm, in a reasonably sized partially observable problem. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Partially observable environments / Decision-theoretic planning / Internal state / Maximum likelihood inference / Feature extraction |
Paper # | NC2006-24 |
Date of Issue |
Conference Information | |
Committee | NC |
---|---|
Conference Date | 2006/6/9(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Neurocomputing (NC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Feature Extraction for Decision-Theoretic Planning in Partially Observable Stochastic Domains |
Sub Title (in English) | |
Keyword(1) | Partially observable environments |
Keyword(2) | Decision-theoretic planning |
Keyword(3) | Internal state |
Keyword(4) | Maximum likelihood inference |
Keyword(5) | Feature extraction |
1st Author's Name | Hajime FUJITA |
1st Author's Affiliation | Nara Institute of Science and Technology() |
2nd Author's Name | Yutaka NAKAMURA |
2nd Author's Affiliation | Nara Institute of Science and Technology |
3rd Author's Name | Shin ISHII |
3rd Author's Affiliation | Nara Institute of Science and Technology |
Date | 2006-06-16 |
Paper # | NC2006-24 |
Volume (vol) | vol.106 |
Number (no) | 102 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |