Profit Sharing 強化学習法における動的強化関数の設計法

講演名	2007-03-06 Profit Sharing 強化学習法における動的強化関数の設計法中野秀洋, 宮内新,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	本稿では,Profit Sharing強化学習法において用いられる強化関数の設計法について検討する.様々な静的または動的な強化関数に対して,学習の合理性と学習効率に関しての理論的な考察を行う.また,本稿で考察する強化関数の枠組みの中で,従来法よりも簡素な動的強化関数を用いた学習法の提案も行う.いくつかの例題タスクを用いた数値実験を行い,各手法における学習性能を比較する.
抄録(英)	This paper studies design of reinforcement functions in the profit sharing reinforcement learning. We gives some theoretical consideration of the learning rationality and effectivity for various static and dynamic reinforcement functions. Also, we propose a learning method using a simpler dynamic reinforcement function compared with the conventional method. This method includes the reinforcement function considered in this paper. We perform numerical experiments for some example tasks, and compare learning performances of each method.
キーワード(和)	強化学習 / 強化関数 / 経験強化型 / Profit sharing
キーワード(英)	Reinforcement learning / reinforcement function / exploitation-oriented / profit sharing
資料番号	NLP2006-154
発行日

講演論文情報詳細
申込み研究会	Nonlinear Problems (NLP)
本文の言語	JPN
タイトル（和）	Profit Sharing 強化学習法における動的強化関数の設計法
サブタイトル（和）
タイトル（英）	Design of Reinforcement Functions in Profit Sharing Reinforcement Learning
サブタイトル（和）
キーワード(1)（和/英）	強化学習 / Reinforcement learning
キーワード(2)（和/英）	強化関数 / reinforcement function
キーワード(3)（和/英）	経験強化型 / exploitation-oriented
キーワード(4)（和/英）	Profit sharing / profit sharing
第 1 著者氏名（和/英）	中野秀洋 / Hidehiro NAKANO
第 1 著者所属（和/英）	武蔵工業大学工学部 Faculty of Engineering, Musashi Institute of Technology
第 2 著者氏名（和/英）	宮内新 / Arata MIYAUCHI
第 2 著者所属（和/英）	武蔵工業大学工学部 Faculty of Engineering, Musashi Institute of Technology
発表年月日	2007-03-06
資料番号	NLP2006-154
巻番号（vol）	vol.106
号番号（no）	574
ページ範囲	pp.-
ページ数	6
発行日