Presentation | 2019-07-10 [Invited Talk] Current Status of Reinforcement Learning Shin-ichi Maeda, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Reinforcement Learning is a framework to optimize an action sequence in terms of the return maximization. In this talk, I will explain the theoretical background of the major reinforcement learning algorithms and its applicability to the real problems by introducing some examples. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Reinforcement Learning / Bellman Equation / Markov Decision Process |
Paper # | RCC2019-18,NS2019-51,RCS2019-108,SR2019-27,SeMI2019-27 |
Date of Issue | 2019-07-03 (RCC, NS, RCS, SR, SeMI) |
Conference Information | |
Committee | SeMI / RCS / NS / SR / RCC |
---|---|
Conference Date | 2019/7/10(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | I-Site Nanba(Osaka) |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Communication and Networked Control for the Future Radio of the AI Age, etc |
Chair | Susumu Ishihara(Shizuoka Univ.) / Tomoaki Otsuki(Keio Univ.) / Yoshikatsu Okazaki(NTT) / Masayuki Ariyoshi(NEC) / Kazunori Hayashi(Osaka City Univ.) |
Vice Chair | Kazuya Monden(Hitachi) / Koji Yamamoto(Kyoto Univ.) / Satoshi Suyama(NTT DoCoMo) / Fumiaki Maehara(Waseda Univ.) / Toshihiko Nishimura(Hokkaido Univ.) / Akihiro Nakao(Univ. of Tokyo) / Suguru Kameda(Tohoku Univ.) / Osamu Takyu(Shinshu Univ.) / Kentaro Ishidu(NICT) / Shunichi Azuma(Nagoya Univ.) / HUAN-BANG LI(NICT) |
Secretary | Kazuya Monden(Kyoto Univ.) / Koji Yamamoto(NTT DOCOMO) / Satoshi Suyama(Hitachi) / Fumiaki Maehara(NTT) / Toshihiko Nishimura(Kyushu Univ.) / Akihiro Nakao(Osaka Pref Univ.) / Suguru Kameda(NTT) / Osamu Takyu(ATR) / Kentaro Ishidu(Univ. of Electro-Comm.) / Shunichi Azuma(Mie Univ.) / HUAN-BANG LI(Kagawa Univ.) |
Assistant | Akira Uchiyama(Osaka Univ.) / Kenji Kanai(Waseda Univ.) / Masafumi Hashimoto(Osaka Univ.) / Kazushi Muraoka(NTT DOCOMO) / Shinsuke Ibi(Doshisha Univ.) / Koichi Adachi(Univ. of Electro-Comm.) / Osamu Nakamura(Sharp) / Shinya Kumagai(Fujitsu) / Shinya Kawano(NTT) / Mai Ohta(Fukuoka Univ.) / Teppei Oyama(Fujitsu) / Kentaro Kobayashi(Nagoya Univ.) / Toshinori Kagawa(NICT) / Masateru Ogura(NAIST) |
Paper Information | |
Registration To | Technical Committee on Sensor Network and Mobile Intelligence / Technical Committee on Radio Communication Systems / Technical Committee on Network Systems / Technical Committee on Smart Radio / Technical Committee on Reliable Communication and Control |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Invited Talk] Current Status of Reinforcement Learning |
Sub Title (in English) | Algorithms and Applications |
Keyword(1) | Reinforcement Learning |
Keyword(2) | Bellman Equation |
Keyword(3) | Markov Decision Process |
1st Author's Name | Shin-ichi Maeda |
1st Author's Affiliation | Preferred Networks(PFN) |
Date | 2019-07-10 |
Paper # | RCC2019-18,NS2019-51,RCS2019-108,SR2019-27,SeMI2019-27 |
Volume (vol) | vol.119 |
Number (no) | RCC-106,NS-107,RCS-108,SR-109,SeMI-110 |
Page | pp.pp.39-39(RCC), pp.49-49(NS), pp.43-43(RCS), pp.49-49(SR), pp.53-53(SeMI), |
#Pages | 1 |
Date of Issue | 2019-07-03 (RCC, NS, RCS, SR, SeMI) |