Combining Multiagent Reinforcement Learning and Discrete Event Modeling for Pathfinding on a Non-Grid Graph

丁 世堯; 青山 秀紀; 林 冬惠

Presentation	2021-08-27 Combining Multiagent Reinforcement Learning and Discrete Event Modeling for Pathfinding on a Non-Grid Graph Shiyao Ding, Hideki Aoyama, Donghui Lin,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In this report, we study a new multiagent path finding (MAPF) problem where multiple agents move on a non-grid graph with the goal of minimizing the traveling time summation of all agents. Multiagent reinforcement learning (MARL), which is effective to solve the traditional MAPF problems on a grid graph, can be applied in this new problem. However, considering the following two issues brought by the non-grid feature: 1) the action space is large where the agent actions are the nodes it can arrive rather than only four directions (up, down, right, left); 2) the state space is large where the agent can stay at edges rather than only nodes, MARL cannot learn optimal paths for all agents effectively. As for solving this problem, we propose a novel MARL algorithm by importing a discrete event model to MARL. Specifically, one part of agents’ pathfinding are solved by the predefined rules. Then, based on those pathfinding results, the other part of agents are trained by MARL further, which can accelerate the learning process. Finally, the experiment results show the effectiveness of our proposed method than some existing algorithms.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Multi-agent PathfindingMulti-agent reinforcement learningDrone delivery
Paper #	SWIM2021-15,SC2021-13
Date of Issue	2021-08-20 (SWIM, SC)

Conference Information
Committee	SWIM / SC
Conference Date	2021/8/27(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Online
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Kenji Saotome(Hosei Univ.) / Shinji Kikuchi(NIMS)
Vice Chair	Akihiro Hayashi(Shizuoka Inst. of Science and Tech.) / Yoji Yamato(NTT) / Kosaku Kimura(Fujitsu Lab.)
Secretary	Akihiro Hayashi(Tokyo Univ. of Science) / Yoji Yamato(Osaka Sangyo Univ.) / Kosaku Kimura(Kobe Univ.)
Assistant	Tsukasa Kudo(Shizuoka Inst. of Science and Tech.) / Kokichi Tsuji(Aichi Pref. Univ.) / Shin Tezuka(Hitachi) / Takao Nakaguchi(KCGI)

Paper Information
Registration To	Technical Committee on Software Interprise Modeling / Technical Committee on Service Computing
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Combining Multiagent Reinforcement Learning and Discrete Event Modeling for Pathfinding on a Non-Grid Graph
Sub Title (in English)
Keyword(1)	Multi-agent PathfindingMulti-agent reinforcement learningDrone delivery
1st Author's Name	Shiyao Ding
1st Author's Affiliation	Kyoto University(Kyoto Univ.)
2nd Author's Name	Hideki Aoyama
2nd Author's Affiliation	Panasonic Corporation(Panasonic)
3rd Author's Name	Donghui Lin
3rd Author's Affiliation	Kyoto University(Kyoto Univ.)
Date	2021-08-27
Paper #	SWIM2021-15,SC2021-13
Volume (vol)	vol.121
Number (no)	SWIM-156,SC-157
Page	pp.pp.13-17(SWIM), pp.13-17(SC),
#Pages	5
Date of Issue	2021-08-20 (SWIM, SC)