An efficient reinforcement learning method for dynamic environments using short term adjustment

Hidehiro Nakano; Satoko Takada; Shuichi Arai; Arata Miyauchi

Summary

International Symposium on Nonlinear Theory and its Applications

2005

Session Number:1-3-3

Session:

Number:1-3-3-2

An efficient reinforcement learning method for dynamic environments using short term adjustment

Hidehiro Nakano, Satoko Takada, Shuichi Arai, Arata Miyauchi,

pp.250-253

Publication Date:2005/10/18

Online ISSN:2188-5079

DOI:10.34385/proc.40.1-3-3-2

PDF download (160KB)

Summary:

This paper proposes a novel reinforcement learning method for dynamic environments. A learning agent estimates changing environments by comparing rule sequence with each action selection probability. If the change is estimated, action selection probabilities are temporarily adjusted. We derive the condition for the amount of adjustment to be flexibly adaptive for dynamic environments. Our method provides better learning performances in various dynamic environments than conventional methods. We present some numerical results for our method applied to dynamic maze problems.