2022
DOI: 10.1109/lra.2022.3150024
|View full text |Cite
|
Sign up to set email alerts
|

Residual Learning From Demonstration: Adapting DMPs for Contact-Rich Manipulation

Abstract: Manipulation skills involving contact and friction are inherent to many robotics tasks. Using the class of motor primitives for peg-in-hole like insertions, we study how robots can learn such skills. Dynamic Movement Primitives (DMP) are a popular way of extracting such policies through behaviour cloning (BC) but can struggle in the context of insertion. Policy adaptation strategies such as residual learning can help improve the overall performance of policies in the context of contactrich manipulation. Howeve… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(19 citation statements)
references
References 32 publications
(58 reference statements)
0
18
0
Order By: Relevance
“…Recent success in deep RL [7], [10], [13], [14] enables a robot to learn complex manipulation tasks such as grasping [7], [8] and insertion [9], [15], [16], [17], [18] driven by a reward. To avoid the requirement of specialist knowledge for reward engineering, several prior works have proposed sample-efficient RL methods that can learn complex manipulation skills from a sparse reward by leveraging a small number of demonstrations for guided exploration [8], [9], [19], [15].…”
Section: Related Workmentioning
confidence: 99%
“…Recent success in deep RL [7], [10], [13], [14] enables a robot to learn complex manipulation tasks such as grasping [7], [8] and insertion [9], [15], [16], [17], [18] driven by a reward. To avoid the requirement of specialist knowledge for reward engineering, several prior works have proposed sample-efficient RL methods that can learn complex manipulation skills from a sparse reward by leveraging a small number of demonstrations for guided exploration [8], [9], [19], [15].…”
Section: Related Workmentioning
confidence: 99%
“…Contact tasks skills learning from demonstration In robotic contact tasks, the robot's position [20], orientation [21], contact force [22][23] and impedance [24][25] [26] information are closely related to the skills involved. Typically, DMPs [27][28] are used to learn skills related to robot position and orientation data, with orientation data commonly represented in the form of quaternions.…”
Section: Related Workmentioning
confidence: 99%
“…Residual policy methods [9], [2] use a pre-program fixed policy while employing RL only to learn a correction policy. LfD is used to derive a baseline policy [10], [1] or to initiate the learning process. While the above-noted approaches have shown significant results in very challenging problems, they are limited by the online RL scheme's ability to collect a large variety of data on the robot and utilize a large collected experience.…”
Section: Related Work a Contact-rich Manipulationmentioning
confidence: 99%