Agent based Modeling and Reinforcement Learning for optimal allocation of resources

Presentation	2022-12-21 Agent based Modeling and Reinforcement Learning for optimal allocation of resources Rashmi Tilak, Toshiharu Sugawara,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	We propose a model and notation for business process for delivery of parcels using drones and attempt to improve the total efficiency of the process using reinforcement learning. Although modeling the business processing is one of important applications of multi-agent systems, it is a challenge to design and control the processing efficiently. For this purpose, we train several drones in a way that helps them determine the right number of resources keeping in view the main factors using reinforcement learning. We also examine the use of two types of Q learning algorithms --- temporal difference (TD) and SARSA --- and investigate the difference between the learned behaviors using them, such as utilization percentage of drones, queue size of packages, idle drones. We show that the trained model outperforms the model with no learning involved and SARSA results in the better performance due to their safer learning.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	drones / SARSA / Reinforcement learning / Warehouse delivery
Paper #	AI2022-45
Date of Issue	2022-12-14 (AI)

Conference Information
Committee	AI
Conference Date	2022/12/21(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Yuichi Sei(Univ. of Electro-Comm.)
Vice Chair	Yuko Sakurai(AIST) / Tadachika Ozono(Nagoya Inst. of Tech.)
Secretary	Yuko Sakurai(Tokyo Univ. of Agriculture and Technology) / Tadachika Ozono(Toho Univ.)
Assistant	Kazutaka Matsuzaki(Chuo Univ.)

Paper Information
Registration To	Technical Committee on Artificial Intelligence and Knowledge-Based Processing
Language	ENG
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Agent based Modeling and Reinforcement Learning for optimal allocation of resources
Sub Title (in English)
Keyword(1)	drones
Keyword(2)	SARSA
Keyword(3)	Reinforcement learning
Keyword(4)	Warehouse delivery
1st Author's Name	Rashmi Tilak
1st Author's Affiliation	Waseda University(Waseda University)
2nd Author's Name	Toshiharu Sugawara
2nd Author's Affiliation	Waseda University(Waseda University)
Date	2022-12-21
Paper #	AI2022-45
Volume (vol)	vol.122
Number (no)	AI-322
Page	pp.pp.68-73(AI),
#Pages	6
Date of Issue	2022-12-14 (AI)