Christian Daniel scite author profile

Movement Primitives are a well-established\ud paradigm for modular movement representation and\ud generation. They provide a data-driven representation\ud of movements and support generalization to novel situations,\ud temporal modulation, sequencing of primitives\ud and controllers for executing the primitive on physical\ud systems. However, while many MP frameworks exhibit\ud some of these properties, there is a need for a uni-\ud fied framework that implements all of them in a principled\ud way. In this paper, we show that this goal can be\ud achieved by using a probabilistic representation. Our\ud approach models trajectory distributions learned from\ud stochastic movements. Probabilistic operations, such as\ud conditioning can be used to achieve generalization to\ud novel situations or to combine and blend movements in\ud a principled way. We derive a stochastic feedback controller\ud that reproduces the encoded variability of the\ud movement and the coupling of the degrees of freedom\ud of the robot. We evaluate and compare our approach\ud on several simulated and real robot scenarios

show abstract

Towards learning hierarchical skills for multi-phase manipulation tasks

Kroemer

Daniel

Neumann

et al. 2015

103

View full text Add to dashboard Cite

Abstract-Most manipulation tasks can be decomposed into a sequence of phases, where the robot's actions have different effects in each phase. The robot can perform actions to transition between phases and, thus, alter the effects of its actions, e.g. grasp an object in order to then lift it. The robot can thus reach a phase that affords the desired manipulation.In this paper, we present an approach for exploiting the phase structure of tasks in order to learn manipulation skills more efficiently. Starting with human demonstrations, the robot learns a probabilistic model of the phases and the phase transitions. The robot then employs model-based reinforcement learning to create a library of motor primitives for transitioning between phases. The learned motor primitives generalize to new situations and tasks. Given this library, the robot uses a value function approach to learn a high-level policy for sequencing the motor primitives. The proposed method was successfully evaluated on a real robot performing a bimanual grasping task.

show abstract

Probabilistic inference for determining options in reinforcement learning

et al. 2016

View full text Add to dashboard Cite

Tasks that require many sequential decisions or complex solutions are hard to solve using conventional reinforcement learning algorithms. Based on the semi Markov decision process setting (SMDP) and the option framework, we propose a model which aims to alleviate these concerns. Instead of learning a single monolithic policy, the agent learns a set of simpler sub-policies as well as the initiation and termination probabilities for each of those sub-policies. While existing option learning algorithms frequently require manual specification of components such as the sub-policies, we present an algorithm which infers all relevant components of the option framework from data. Furthermore, the proposed approach is based on parametric option representations and works well in combination with current policy search methods, which are particularly well suited for continuous real-world tasks. We present results on SMDPs with discrete as well as continuous state-action spaces. The results show that the presented algorithm can combine simple sub-policies to solve complex tasks and can improve learning performance on simpler tasks.

show abstract

Active Reward Learning

Daniel

Viering

Metz

et al. 2014

View full text Add to dashboard Cite

While reward functions are an essential component of many robot learning methods, defining such functions remains a hard problem in many practical applications. For tasks such as grasping, there are no reliable success measures available. Defining reward functions by hand requires extensive task knowledge and often leads to undesired emergent behavior. Instead, we propose to learn the reward function through active learning, querying human expert knowledge for a subset of the agent's rollouts. We introduce a framework, wherein a traditional learning algorithm interplays with the reward learning component, such that the evolution of the action learner guides the queries of the reward learner. We demonstrate results of our method on a robot grasping task and show that the learned reward function generalizes to a similar task.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.