[招待論文]観測軌道を基にした運動の最適化 : 起立運動,歩行運動への適用(PRMU&NCテーマセッション(3) : 模倣学習)(認識と学習,模倣学習)

Presentation	2003/10/17 Optimization of observed nominal trajectories : Application to stand-up and biped walking movements Jun MORIMOTO, Kenji DOYA, Christopher G. ATKESON,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	In imitation learning framework, because a learner can not detect teacher's policy or exact torque output directly from observed trajectories, the learner can not easily generate target movements by only observing trajectories of teacher's motions. In such a case, the learner needs to optimize the movements based on observed trajectories with considering own dynamics. In this study, we propose to use hierarchical reinforcement learning (HRL) and differential dynamic programming (DDP) to optimize trajectories. As a first example, we apply HRL to stand-up task using 2-ioint 3-link robot model. We show that by using HRL in imitation learning framework, the learner acquire the stand-up movement faster than without using observation of teacher's trajectories. As a second example, we apply DDP to biped walking task using 5-link biped robot model. We show that the learner using robust DDP can generate low-torque and robust biped walking trajectories than a PD servo controller.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	reinforcement learning / differential dynamic programming / stand-up / biped locomotion / imitation learning
Paper #	PRMU2003-135,NC2003-66
Date of Issue

Paper Information
Registration To	Pattern Recognition and Media Understanding (PRMU)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Optimization of observed nominal trajectories : Application to stand-up and biped walking movements
Sub Title (in English)
Keyword(1)	reinforcement learning
Keyword(2)	differential dynamic programming
Keyword(3)	stand-up
Keyword(4)	biped locomotion
Keyword(5)	imitation learning
1st Author's Name	Jun MORIMOTO
1st Author's Affiliation	ATR Computational Neurscience Labs()
2nd Author's Name	Kenji DOYA
2nd Author's Affiliation	ATR Computational Neurscience Labs:CREST, JST
3rd Author's Name	Christopher G. ATKESON
3rd Author's Affiliation	The Robotics Institute, Carnegie Mellon University
Date	2003/10/17
Paper #	PRMU2003-135,NC2003-66
Volume (vol)	vol.103
Number (no)	390
Page	pp.pp.-
#Pages	6
Date of Issue