Presentation 2022-12-21
Agent based Modeling and Reinforcement Learning for optimal allocation of resources
Rashmi Tilak, Toshiharu Sugawara,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We propose a model and notation for business process for delivery of parcels using drones and attempt to improve the total efficiency of the process using reinforcement learning. Although modeling the business processing is one of important applications of multi-agent systems, it is a challenge to design and control the processing efficiently. For this purpose, we train several drones in a way that helps them determine the right number of resources keeping in view the main factors using reinforcement learning. We also examine the use of two types of Q learning algorithms --- temporal difference (TD) and SARSA --- and investigate the difference between the learned behaviors using them, such as utilization percentage of drones, queue size of packages, idle drones. We show that the trained model outperforms the model with no learning involved and SARSA results in the better performance due to their safer learning.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) drones / SARSA / Reinforcement learning / Warehouse delivery
Paper # AI2022-45
Date of Issue 2022-12-14 (AI)

Conference Information
Committee AI
Conference Date 2022/12/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Yuichi Sei(Univ. of Electro-Comm.)
Vice Chair Yuko Sakurai(AIST) / Tadachika Ozono(Nagoya Inst. of Tech.)
Secretary Yuko Sakurai(Tokyo Univ. of Agriculture and Technology) / Tadachika Ozono(Toho Univ.)
Assistant Kazutaka Matsuzaki(Chuo Univ.)

Paper Information
Registration To Technical Committee on Artificial Intelligence and Knowledge-Based Processing
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Agent based Modeling and Reinforcement Learning for optimal allocation of resources
Sub Title (in English)
Keyword(1) drones
Keyword(2) SARSA
Keyword(3) Reinforcement learning
Keyword(4) Warehouse delivery
1st Author's Name Rashmi Tilak
1st Author's Affiliation Waseda University(Waseda University)
2nd Author's Name Toshiharu Sugawara
2nd Author's Affiliation Waseda University(Waseda University)
Date 2022-12-21
Paper # AI2022-45
Volume (vol) vol.122
Number (no) AI-322
Page pp.pp.68-73(AI),
#Pages 6
Date of Issue 2022-12-14 (AI)