Solving POMDPs using Restricted Boltzmann Machines with Echo State Networks

Otsuka,Makoto; Yoshimoto,Junichiro; Elfwing,Stefan; Doya,Kenji

IEICE Technical Committee Submission System
Conference Paper's Information

Online Proceedings
[Sign in]
Tech. Rep. Archives

Paper Abstract and Keywords
Presentation		2011-06-24 16:30 Solving POMDPs using Restricted Boltzmann Machines with Echo State Networks Makoto Otsuka, Junichiro Yoshimoto, Stefan Elfwing, Kenji Doya (OIST) NC2011-19
Abstract	(in Japanese)	(See Japanese page)
	(in English)	A partially observable Markov decision process (POMDP) can be solved in a model-based way using explicit knowledge of the environmental dynamics or in a model-free way using implicit representations of task-relevant states. Here we consider a model-free approach of combining an echo state network (ESN) for summarizing past actions and observations and a restricted Boltzmann machine (RBM) for learning action values in a high-dimensional state space. Simulation results in robot navigation tasks showed that the ESN can capture relevant information in the sequence of high dimensional observations and that RBM can construct task-oriented internal representation in its hidden layer.
Keyword	(in Japanese)	(See Japanese page)
	(in English)	partially observable Markov decision processes / restricted Boltzmann machines / echo state networks / goal-directed representation / free energy / / /
Reference Info.		IEICE Tech. Rep., vol. 111, no. 96, NC2011-19, pp. 143-148, June 2011.
Paper #		NC2011-19
Date of Issue		2011-06-16 (NC)
ISSN		Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380
Copyright and reproduction		All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
Download PDF		NC2011-19

Conference Information
Committee	NC IPSJ-BIO
Conference Date	2011-06-23 - 2011-06-24
Place (in Japanese)	(See Japanese page)
Place (in English)	50th Anniversary Memorial Hall, University of the Ryukyus
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Machine Learning Approach to Biodata Mining, and General
Paper Information
Registration To	NC
Conference Code	2011-06-NC-BIO
Language	English
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Solving POMDPs using Restricted Boltzmann Machines with Echo State Networks
Sub Title (in English)
Keyword(1)	partially observable Markov decision processes
Keyword(2)	restricted Boltzmann machines
Keyword(3)	echo state networks
Keyword(4)	goal-directed representation
Keyword(5)	free energy
Keyword(6)
Keyword(7)
Keyword(8)
1st Author's Name	Makoto Otsuka
1st Author's Affiliation	Okinawa Institute of Science and Technology (OIST)
2nd Author's Name	Junichiro Yoshimoto
2nd Author's Affiliation	Okinawa Institute of Science and Technology (OIST)
3rd Author's Name	Stefan Elfwing
3rd Author's Affiliation	Okinawa Institute of Science and Technology (OIST)
4th Author's Name	Kenji Doya
4th Author's Affiliation	Okinawa Institute of Science and Technology (OIST)
5th Author's Name
5th Author's Affiliation	()
6th Author's Name
6th Author's Affiliation	()
7th Author's Name
7th Author's Affiliation	()
8th Author's Name
8th Author's Affiliation	()
9th Author's Name
9th Author's Affiliation	()
10th Author's Name
10th Author's Affiliation	()
11th Author's Name
11th Author's Affiliation	()
12th Author's Name
12th Author's Affiliation	()
13th Author's Name
13th Author's Affiliation	()
14th Author's Name
14th Author's Affiliation	()
15th Author's Name
15th Author's Affiliation	()
16th Author's Name
16th Author's Affiliation	()
17th Author's Name
17th Author's Affiliation	()
18th Author's Name
18th Author's Affiliation	()
19th Author's Name
19th Author's Affiliation	()
20th Author's Name
20th Author's Affiliation	()
Speaker	Author-1
Date Time	2011-06-24 16:30:00
Presentation Time	25 minutes
Registration for	NC
Paper #	NC2011-19
Volume (vol)	vol.111
Number (no)	no.96
Page	pp.143-148
#Pages	6
Date of Issue	2011-06-16 (NC)

[Return to Top Page]

[Return to IEICE Web Page]

The Institute of Electronics, Information and Communication Engineers (IEICE), Japan