強化学習を用いた交渉戦略学習アルゴリズム

講演名	2001/7/16 強化学習を用いた交渉戦略学習アルゴリズム大竹麗央, 西田豊明,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	本研究では、インターネットオークション上で、ユーザの代わりに自動入札を行うエージェントが、周囲の状況に応じた入札戦略を獲得していくアルゴリズムを提案する。アルゴリズムの根幹である学習機構および意思決定機構には、強化学習の代表的手法であるQ学習とe-greedy選択を用いる。せり上げオークションを例にシミュレーションを行なったところ、落札1回あたりの効用を大きくするための戦略を獲得することができた。
抄録(英)	In this paper, we propose an algorithm for self-bidding agents in the internet auction to learn bidding strategy depending on the circumstances. Q-learning and e-greedy methods, the typical techniques in reinforcement learning, are used in learning and decision-making modules which are the basis of this algorithm. We applied this algorithm to an ascending-bid auction, and the agents could acquire the bidding strategy to get higher utility in one successful bid.
キーワード(和)	オークション / 入札エージェント / 入札戦略 / 強化学習 / 効用
キーワード(英)	auction / self-bidding agent / bidding strategy / reinforcement learning / utility
資料番号	OFS2001-11,AI2001-16
発行日

講演論文情報詳細
申込み研究会	Artificial Intelligence and Knowledge-Based Processing (AI)
本文の言語	JPN
タイトル（和）	強化学習を用いた交渉戦略学習アルゴリズム
サブタイトル（和）
タイトル（英）	Algorithm for Learning Negotiation Strategy with Reinforcement Learning
サブタイトル（和）
キーワード(1)（和/英）	オークション / auction
キーワード(2)（和/英）	入札エージェント / self-bidding agent
キーワード(3)（和/英）	入札戦略 / bidding strategy
キーワード(4)（和/英）	強化学習 / reinforcement learning
キーワード(5)（和/英）	効用 / utility
第 1 著者氏名（和/英）	大竹麗央 / Leo OHTAKE
第 1 著者所属（和/英）	東京大学大学院情報理工学系研究科電子情報学専攻 Department of Information and Communication Engineering, Graduate school of Information Technology, The University of Tokyo.
第 2 著者氏名（和/英）	西田豊明 / Toyoaki NISHIDA
第 2 著者所属（和/英）	東京大学大学院情報理工学系研究科電子情報学専攻 Department of Information and Communication Engineering, Graduate school of Information Technology, The University of Tokyo.
発表年月日	2001/7/16
資料番号	OFS2001-11,AI2001-16
巻番号（vol）	vol.101
号番号（no）	210
ページ範囲	pp.-
ページ数	8
発行日