Presentation | 2003/10/17 Optimization of observed nominal trajectories : Application to stand-up and biped walking movements Jun MORIMOTO, Kenji DOYA, Christopher G. ATKESON, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In imitation learning framework, because a learner can not detect teacher's policy or exact torque output directly from observed trajectories, the learner can not easily generate target movements by only observing trajectories of teacher's motions. In such a case, the learner needs to optimize the movements based on observed trajectories with considering own dynamics. In this study, we propose to use hierarchical reinforcement learning (HRL) and differential dynamic programming (DDP) to optimize trajectories. As a first example, we apply HRL to stand-up task using 2-ioint 3-link robot model. We show that by using HRL in imitation learning framework, the learner acquire the stand-up movement faster than without using observation of teacher's trajectories. As a second example, we apply DDP to biped walking task using 5-link biped robot model. We show that the learner using robust DDP can generate low-torque and robust biped walking trajectories than a PD servo controller. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | reinforcement learning / differential dynamic programming / stand-up / biped locomotion / imitation learning |
Paper # | PRMU2003-135,NC2003-66 |
Date of Issue |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2003/10/17(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Pattern Recognition and Media Understanding (PRMU) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Optimization of observed nominal trajectories : Application to stand-up and biped walking movements |
Sub Title (in English) | |
Keyword(1) | reinforcement learning |
Keyword(2) | differential dynamic programming |
Keyword(3) | stand-up |
Keyword(4) | biped locomotion |
Keyword(5) | imitation learning |
1st Author's Name | Jun MORIMOTO |
1st Author's Affiliation | ATR Computational Neurscience Labs() |
2nd Author's Name | Kenji DOYA |
2nd Author's Affiliation | ATR Computational Neurscience Labs:CREST, JST |
3rd Author's Name | Christopher G. ATKESON |
3rd Author's Affiliation | The Robotics Institute, Carnegie Mellon University |
Date | 2003/10/17 |
Paper # | PRMU2003-135,NC2003-66 |
Volume (vol) | vol.103 |
Number (no) | 390 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |