Relational reinforcement learning with guided demonstrations

Martínez, David Martínez; Alenyà, Guillem; Torras, Carme

doi:10.1016/j.artint.2015.02.006

Cited by 37 publications

(28 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In contrast, we use the REX-D algorithm [61], which combines relational RL and active demonstration requests. REX-D requests demonstrations only when they can save a lot of time, because teacher's time is considered to be very valuable, and uses autonomous exploration otherwise.…”

Section: High-level Planning System and Executionmentioning

confidence: 99%

“…We introduced the REX-D algorithm [61] to address the learning phase, which is an efficient model-based reinforcement learning (RL) method combined with additional human demonstrations upon request. It can take three alternative strategies: one is to explore the state space to improve the model and achieve better rewards in the long term; another is to exploit the available knowledge by executing the manipulations that maximize the reward with the current learned model [76]; and the last one is to request a demonstration from the teacher [60].…”

Section: B Learning On the Planning Levelmentioning

confidence: 99%

See 1 more Smart Citation

Teaching a Robot the Semantics of Assembly Tasks

Savarimuthu

Buch

Schlette

et al. 2018

IEEE Trans. Syst. Man Cybern, Syst.

Self Cite

View full text Add to dashboard Cite

We present a three-level cognitive system in a Learning by Demonstration (LbD) context. The system allows for learning and transfer on the sensorimotor level as well as the planning level. The fundamentally different data structures associated to these two levels are connected by an efficient mid-level representation based on so called "Semantic Event Chains". We describe details of the representations and quantify the effect of the associated learning procedures for each level under different amounts of noise. Moreover, we demonstrate the performance of the overall system by three demonstrations that have been performed at a project review. The described system has a Technical Readiness Level (TRL) of 4, which in an ongoing follow-up project will be raised to TRL 6.

show abstract

Section: High-level Planning System and Executionmentioning

confidence: 99%

Section: B Learning On the Planning Levelmentioning

confidence: 99%

Teaching a Robot the Semantics of Assembly Tasks

Savarimuthu

Buch

Schlette

et al. 2018

IEEE Trans. Syst. Man Cybern, Syst.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Generally, in PS approaches, the robot is taught an initial trajectory that is then improved through autonomously-generated rollouts and policy updates [7]. A model-based RL proved to be efficient for learning action sequences with the user in the role of a teacher [19]. Instead of applying RL to teach the robot new actions and their effects, in this work, RL is applied to modify the robot trajectory segment selected by the user.…”

Section: Relevant Workmentioning

confidence: 99%

Robot motion adaptation through user intervention and reinforcement learning

Jevtić

Colomé

Alenyà

et al. 2018

Pattern Recognition Letters

Self Cite

View full text Add to dashboard Cite

Assistant robots are designed to perform specific tasks for the user, but their performance is rarely optimal, hence they are required to adapt to user preferences or new task requirements. In the previous work, the potential of an interactive learning framework based on user intervention and reinforcement learning (RL) was assessed. The framework allowed the user to correct an unfitted segment of the robot trajectory by using hand movements to guide the robot along a corrective path. So far, only the usability of the framework was evaluated through experiments with users. In the current work, the framework is described in detail and its ability to learn from a set of sample trajectories using an RL algorithm is analyzed. To evaluate the learning performance, three versions of the framework are proposed that differ in the method used to obtain the sample trajectories, which are: human-guided learning, autonomous learning, and combined human-guided with autonomous learning. The results show that the combination of the human-guided and autonomous learning achieved the best performance, and although it needed a higher number of sample trajectories than the human-guided learning, it required less user involvement. Autonomous learning alone obtained the lowest reward value and needed the highest number of sample trajectories.

show abstract

“…MDP is also used to clear objects from a table in fully-observable problems with uncertainty [27]. The same authors employ REX-D algorithm that integrates active teacher demonstration for increasing learning speed in order to sweep lentils from a plane [28]. Interactive RL approach with contextual affordances is developed by Cruz et al to clean a table using state-action-reward-state-action (SARSA) [7].…”

mentioning

confidence: 99%

Cleaning Tasks Knowledge Transfer Between Heterogeneous Robots: a Deep Learning Approach

Kim

Cauli

Vicente

et al. 2019

J Intell Robot Syst

View full text Add to dashboard Cite

Nowadays, autonomous service robots are becoming an important topic in robotic research. Differently from typical industrial scenarios, with highly controlled environments, service robots must show an additional robustness to task perturbations and changes in the characteristics of their sensory feedback. In this paper, a robot is taught to perform two different cleaning tasks over a table, using a learning from demonstration paradigm. However, differently from other approaches, a convolutional neural network is used to generalize the demonstrations to different, not yet seen dirt or stain patterns on the same table using only visual feedback, and to perform cleaning movements accordingly. Robustness to robot posture and illumination changes is achieved using data augmentation techniques and camera images transformation. This robustness allows the transfer of knowledge regarding execution of cleaning tasks between heterogeneous robots operating in different environmental settings. To demonstrate the viability of the proposed approach, a network trained in Lisbon to perform cleaning tasks, using the iCub robot, Parts of this manuscript were previously presented at the

show abstract

Relational reinforcement learning with guided demonstrations

Cited by 37 publications

References 20 publications

Teaching a Robot the Semantics of Assembly Tasks

Teaching a Robot the Semantics of Assembly Tasks

Robot motion adaptation through user intervention and reinforcement learning

Cleaning Tasks Knowledge Transfer Between Heterogeneous Robots: a Deep Learning Approach

Contact Info

Product

Resources

About