Combining Multiagent Reinforcement Learning and Discrete Event Modeling for Pathfinding on a Non-Grid Graph

丁 世堯; 青山 秀紀; 林 冬惠

講演名	2021-08-27 Combining Multiagent Reinforcement Learning and Discrete Event Modeling for Pathfinding on a Non-Grid Graph 丁世堯(京大), 青山秀紀(パナソニック), 林冬惠(京大),
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)
抄録(英)	In this report, we study a new multiagent path finding (MAPF) problem where multiple agents move on a non-grid graph with the goal of minimizing the traveling time summation of all agents. Multiagent reinforcement learning (MARL), which is effective to solve the traditional MAPF problems on a grid graph, can be applied in this new problem. However, considering the following two issues brought by the non-grid feature: 1) the action space is large where the agent actions are the nodes it can arrive rather than only four directions (up, down, right, left); 2) the state space is large where the agent can stay at edges rather than only nodes, MARL cannot learn optimal paths for all agents effectively. As for solving this problem, we propose a novel MARL algorithm by importing a discrete event model to MARL. Specifically, one part of agents’ pathfinding are solved by the predefined rules. Then, based on those pathfinding results, the other part of agents are trained by MARL further, which can accelerate the learning process. Finally, the experiment results show the effectiveness of our proposed method than some existing algorithms.
キーワード(和)
キーワード(英)	Multi-agent PathfindingMulti-agent reinforcement learningDrone delivery
資料番号	SWIM2021-15,SC2021-13
発行日	2021-08-20 (SWIM, SC)

研究会情報
研究会	SWIM / SC
開催期間	2021/8/27(から1日開催)
開催地（和）	オンライン開催 (FIT2021との併催)
開催地（英）	Online
テーマ（和）	ソサエティ5.0に向けたサービスコンピューティングとインタプライズモデル化技術,一般
テーマ（英）
委員長氏名（和）	五月女健治(法政大) / 菊地伸治(物質・材料研究機構)
委員長氏名（英）	Kenji Saotome(Hosei Univ.) / Shinji Kikuchi(NIMS)
副委員長氏名（和）	林章浩(静岡理工科大) / 山登庸次(NTT) / 木村功作(富士通研)
副委員長氏名（英）	Akihiro Hayashi(Shizuoka Inst. of Science and Tech.) / Yoji Yamato(NTT) / Kosaku Kimura(Fujitsu Lab.)
幹事氏名（和）	能上慎也(東京理科大) / 山田耕嗣(阪産大) / 中村匡秀(神戸大) / 細野繁(東京工科大)
幹事氏名（英）	Shinya Nogami(Tokyo Univ. of Science) / Koji Yamada(Osaka Sangyo Univ.) / Masahide Nakamura(Kobe Univ.) / Shigeru Hosono(Tokyo Univ. of Tech.)
幹事補佐氏名（和）	工藤司(静岡理工科大) / 辻孝吉(愛知県立大) / 手塚伸(日立) / 中口孝雄(京都情報大学院大)
幹事補佐氏名（英）	Tsukasa Kudo(Shizuoka Inst. of Science and Tech.) / Kokichi Tsuji(Aichi Pref. Univ.) / Shin Tezuka(Hitachi) / Takao Nakaguchi(KCGI)

講演論文情報詳細
申込み研究会	Technical Committee on Software Interprise Modeling / Technical Committee on Service Computing
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	Combining Multiagent Reinforcement Learning and Discrete Event Modeling for Pathfinding on a Non-Grid Graph
サブタイトル（和）
キーワード(1)（和/英）	/ Multi-agent PathfindingMulti-agent reinforcement learningDrone delivery
第 1 著者氏名（和/英）	丁世堯 / Shiyao Ding
第 1 著者所属（和/英）	京都大学(略称：京大) Kyoto University(略称：Kyoto Univ.)
第 2 著者氏名（和/英）	青山秀紀 / Hideki Aoyama
第 2 著者所属（和/英）	パナソニック株式会社(略称：パナソニック) Panasonic Corporation(略称：Panasonic)
第 3 著者氏名（和/英）	林冬惠 / Donghui Lin
第 3 著者所属（和/英）	京都大学(略称：京大) Kyoto University(略称：Kyoto Univ.)
発表年月日	2021-08-27
資料番号	SWIM2021-15,SC2021-13
巻番号（vol）	vol.121
号番号（no）	SWIM-156,SC-157
ページ範囲	pp.13-17(SWIM), pp.13-17(SC),
ページ数	5
発行日	2021-08-20 (SWIM, SC)