Presentation | 2021-03-03 Markov Decision Processes for Simultaneous Control of Multiple Objects with Different State Transition Probabilities in Each Cluster Yuto Motomura, Akira Kamatsuka, Koki Kazama, Toshiyasu Matsushima, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this study, we propose an extended MDP model, which is a Markov decision process model with multiple control objects and different state transition probabilities for each cluster. In the model, we consider problems of deriving policies from historical data of each objects which are collected in advance. We formulate the problems based on the Bayesian decision theory for both known and unknown state transition probabilities. Then we derive the optimal solutions of the problems. Finally, we propose calculation algorithms for the optimal solutions. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Markov decision processes / Bayesian decision theory / machine learning / collective control / recommender system / recommendation system |
Paper # | IBISML2020-49 |
Date of Issue | 2021-02-23 (IBISML) |
Conference Information | |
Committee | IBISML |
---|---|
Conference Date | 2021/3/2(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Organized and general sessions on machine learning |
Chair | Ichiro Takeuchi(Nagoya Inst. of Tech.) |
Vice Chair | Masashi Sugiyama(Univ. of Tokyo) / Koji Tsuda(Univ. of Tokyo) |
Secretary | Masashi Sugiyama(AIST) / Koji Tsuda(NTT) |
Assistant | Atsuyoshi Nakamura(Hokkaido Univ.) / Shigeyuki Oba(Miidas) |
Paper Information | |
Registration To | Technical Committee on Infomation-Based Induction Sciences and Machine Learning |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Markov Decision Processes for Simultaneous Control of Multiple Objects with Different State Transition Probabilities in Each Cluster |
Sub Title (in English) | |
Keyword(1) | Markov decision processes |
Keyword(2) | Bayesian decision theory |
Keyword(3) | machine learning |
Keyword(4) | collective control |
Keyword(5) | recommender system |
Keyword(6) | recommendation system |
1st Author's Name | Yuto Motomura |
1st Author's Affiliation | Waseda University(Waseda Univ.) |
2nd Author's Name | Akira Kamatsuka |
2nd Author's Affiliation | Waseda University(Waseda Univ.) |
3rd Author's Name | Koki Kazama |
3rd Author's Affiliation | Waseda University(Waseda Univ.) |
4th Author's Name | Toshiyasu Matsushima |
4th Author's Affiliation | Waseda University(Waseda Univ.) |
Date | 2021-03-03 |
Paper # | IBISML2020-49 |
Volume (vol) | vol.120 |
Number (no) | IBISML-395 |
Page | pp.pp.47-54(IBISML), |
#Pages | 8 |
Date of Issue | 2021-02-23 (IBISML) |