A User Study on Robot Skill Learning Without a Cost Function: Optimization of Dynamic Movement Primitives via Naive User Feedback

Vollmer, Anna-Lisa; Hemion, Nikolas

doi:10.3389/frobt.2018.00077

Cited by 7 publications

(9 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, we present a system that does not need a predefined cost function or feature representation, and can learn successful movement skills from non-expert users in a couple of minutes. In contrast to work done by Vollmer and Hemion [3], we have looked into the drawbacks of using absolute-scale feedback from users, which is influenced by a drift in evaluation and the requirement of anchoring to a reference point, by utilizing preference-based feedback from users.…”

Section: A Related Workmentioning

confidence: 99%

“…The goal of the game is to catch the ball with the cup through skillful movement. Kober and Peters [19] have demonstrated that the cup-and-ball movement can be learned by a robot arm using DMP-based optimization, and Vollmer and Hemion [3] have demonstrated that Pepper is capable of mastering the game with human absolute scale ratings as reward. In this study, the cup-and-ball toy was built such that the size of the cup and ball resulted in a level of difficulty suitable for our purposes.…”

Section: Study 1: Human Feedbackmentioning

confidence: 99%

“…Therefore, this work investigates the applicability of an optimization system from a user-centered perspective and investigates what kind of user feedback for the optimization is intuitively usable without much effort. We base our work on recent work by Vollmer and Hemion [3], who have shown that naive users can teach robots complex continuous movement skills via a simple user interface. We here also concentrate on robot learning for complex movement skills with a human teacher and compare the types of feedback a teacher could give as a performance measure: feedback as star ratings on an absolute scale for single roll-outs (as in [3]) versus preference-based feedback for pairwise comparisons.…”

Section: Introductionmentioning

confidence: 99%

“…We base our work on recent work by Vollmer and Hemion [3], who have shown that naive users can teach robots complex continuous movement skills via a simple user interface. We here also concentrate on robot learning for complex movement skills with a human teacher and compare the types of feedback a teacher could give as a performance measure: feedback as star ratings on an absolute scale for single roll-outs (as in [3]) versus preference-based feedback for pairwise comparisons. In the following we will refer to the two conditions as 'absolute scale' and 'preference-based', respectively.…”

Section: Introductionmentioning

confidence: 99%

“…6]. This is supported by the number of users who were not able to successfully teach the robot, because their strategy was not compatible with the properties of the underlying learning algorithm in [3]. On the other hand, outside of interactive task learning, people have been shown to be very proficient at giving preference-based feedback and at comparing things [7].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Interactive Robot Task Learning: Human Teaching Proficiency With Different Feedback Approaches

Hindemith

Bruns

Noller

et al. 2023

IEEE Trans. Cogn. Dev. Syst.

Self Cite

View full text Add to dashboard Cite

The deployment of versatile robot systems in diverse environments requires intuitive approaches for humans to flexibly teach them new skills. In our present work, we investigate different user feedback types to teach a real robot a new movement skill. We compare feedback as star ratings on an absolute scale for single roll-outs versus preference-based feedback for pairwise comparisons with respective optimization algorithms (i.e., a variation of co-variance matrix adaptationevolution strategy (CMA-ES) and random optimization) to teach the robot the game of skill cup-and-ball. In an experimental investigation with users, we investigated the influence of the feedback type on the user experience of interacting with the different interfaces and the performance of the learning systems. While there is no significant difference for the subjective user experience between the conditions, there is a significant difference in learning performance. The preference-based system learned the task quicker, but this did not influence the users' evaluation of it. In a follow-up study, we confirmed that the difference in learning performance indeed can be attributed to the human users' performance.

show abstract

Section: A Related Workmentioning

confidence: 99%

Section: Study 1: Human Feedbackmentioning

confidence: 99%