強化学習に基づく操船経路の決定について

講演名	2002/3/8 強化学習に基づく操船経路の決定について三堀邦彦, 神尾武司,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	近年、人工知能や制御理論の分野で強化学習の概念が注目されている。本論文は、強化学習に基づく最短操船経路の決定について議論する。操船運動は、4次元の非線形な微分方程式で記述される。4つの状態変数の離散化を通して、基本的な強化学習法であるQ-Leamingアルゴリズムをこの運動方程式に適用する。簡単な例題においてアルゴリズムの有効性が確認される。
抄録(英)	Recently, reinforcement learning has been attracting attentions of researcher in the fields of artificial intelligence and control theory. This paper discusses the shortest path planning in maneuvering motion of a ship based on reinforcement learning. The maneuvering motion of a ship is described by four dimensional nonlinear equations. Q-Learning algorithm, which is a basic algorithm in reinforcement learning, is applied to this equations with the discretization of their four state variables. The algorithm performance is demonstrated in a simple example.
キーワード(和)	操船運動方程式 / 最短経路決定問題 / 強化学習 / Q-Leaming
キーワード(英)	Ship-maneuvering motion equations / Short path problem / Reinforcement learning / Q-Learning
資料番号	NLP2001-111
発行日

講演論文情報詳細
申込み研究会	Nonlinear Problems (NLP)
本文の言語	JPN
タイトル（和）	強化学習に基づく操船経路の決定について
サブタイトル（和）
タイトル（英）	On planning of maneuvering motion of a ship based on reinforcement learning
サブタイトル（和）
キーワード(1)（和/英）	操船運動方程式 / Ship-maneuvering motion equations
キーワード(2)（和/英）	最短経路決定問題 / Short path problem
キーワード(3)（和/英）	強化学習 / Reinforcement learning
キーワード(4)（和/英）	Q-Leaming / Q-Learning
第 1 著者氏名（和/英）	三堀邦彦 / Kunihiko MITSUBORI
第 1 著者所属（和/英）	海上保安大学校通信基礎工学講座 Japan Coast Guard Academy
第 2 著者氏名（和/英）	神尾武司 / Takeshi KAMIO
第 2 著者所属（和/英）	広島市立大学情報科学部情報機械システム工学科 Hiroshima City University
発表年月日	2002/3/8
資料番号	NLP2001-111
巻番号（vol）	vol.101
号番号（no）	723
ページ範囲	pp.-
ページ数	8
発行日