高次元連続観測空間における安全な強化学習

Presentation	2021-03-03 Safe reinforcement learning in high-dimensional continuous spaces Takumi Umemoto, Tohgoroh Matsui, Atsuko Mutoh, Koich Moriyama, Inuzuka Nobuhiro,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	We propose a method to extend the reinforcement learning method (CSEQ) based on success probability and profit in continuous state space to higher dimensions. Reinforcement learning is a machine learning method that learns better behavior based on trial and error, and there is a method called EQ based on success probability and profit as safe reinforcement learning focusing on learning danger avoidance behavior, and continuous observation Its effectiveness has been confirmed in the problem on the two-dimensional space of. We propose safe reinforcement learning that deals with high-dimensional continuous space using the mean values of latent variables modeled by VAE.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	reinforcement learning / safe reinforcement learning / deep learning / auto encoder
Paper #	IBISML2020-50
Date of Issue	2021-02-23 (IBISML)

Conference Information
Committee	IBISML
Conference Date	2021/3/2(3days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Online
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Organized and general sessions on machine learning
Chair	Ichiro Takeuchi(Nagoya Inst. of Tech.)
Vice Chair	Masashi Sugiyama(Univ. of Tokyo) / Koji Tsuda(Univ. of Tokyo)
Secretary	Masashi Sugiyama(AIST) / Koji Tsuda(NTT)
Assistant	Atsuyoshi Nakamura(Hokkaido Univ.) / Shigeyuki Oba(Miidas)

Paper Information
Registration To	Technical Committee on Infomation-Based Induction Sciences and Machine Learning
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Safe reinforcement learning in high-dimensional continuous spaces
Sub Title (in English)
Keyword(1)	reinforcement learning
Keyword(2)	safe reinforcement learning
Keyword(3)	deep learning
Keyword(4)	auto encoder
1st Author's Name	Takumi Umemoto
1st Author's Affiliation	Nagoya Institute of Technology(NIT)
2nd Author's Name	Tohgoroh Matsui
2nd Author's Affiliation	Chubu University(Chubu Univ.)
3rd Author's Name	Atsuko Mutoh
3rd Author's Affiliation	Nagoya Institute of Technology(NIT)
4th Author's Name	Koich Moriyama
4th Author's Affiliation	Nagoya Institute of Technology(NIT)
5th Author's Name	Inuzuka Nobuhiro
5th Author's Affiliation	Nagoya Institute of Technology(NIT)
Date	2021-03-03
Paper #	IBISML2020-50
Volume (vol)	vol.120
Number (no)	IBISML-395
Page	pp.pp.55-62(IBISML),
#Pages	8
Date of Issue	2021-02-23 (IBISML)