Which is the best intrinsic motivation signal for learning multiple skills?

Santucci, Vieri Giuliano; Baldassarre, Gianluca; Mirolli, Marco

doi:10.3389/fnbot.2013.00022

Cited by 53 publications

(55 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The learning progress in achieving a goal can be used as a transient reward so that the system focuses on tasks where it is learning the most, moving to other ones when the task-related skill has been completely learnt or when more promising activities come at hand [11], [12]. This strategy allows the learning of multiple separated skills, and possibly a dynamical transfer of knowledge between tasks that require similar policies [13].…”

Section: Introductionmentioning

confidence: 99%

Autonomous Reinforcement Learning of Multiple Interrelated Tasks

Santucci

Baldassarre

Cartoni

2019

2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Self Cite

View full text Add to dashboard Cite

Autonomous multiple tasks learning is a fundamental capability to develop versatile artificial agents that can act in complex environments. In real-world scenarios, tasks may be interrelated (or "hierarchical") so that a robot has to first learn to achieve some of them to set the preconditions for learning other ones. Even though different strategies have been used in robotics to tackle the acquisition of interrelated tasks, in particular within the developmental robotics framework, autonomous learning in this kind of scenarios is still an open question. Building on previous research in the framework of intrinsically motivated open-ended learning, in this work we describe how this question can be addressed working on the level of task selection, in particular considering the multiple interrelated tasks scenario as an MDP where the system is trying to maximise its competence over all the tasks.

show abstract

Section: Introductionmentioning

confidence: 99%

Autonomous Reinforcement Learning of Multiple Interrelated Tasks

Santucci

Baldassarre

Cartoni

2019

2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Self Cite

View full text Add to dashboard Cite

show abstract

“…The matching value is used to determine the probability distribution over the goals that supports the selection of the goal to pursue (number 2 in the figure). The probability distribution is generated on the basis of intrinsic motivations, for example related to competence (e.g., the goals with a lower competence, or with a higher competence-improvement, have a higher probability of selection; Santucci et al, 2013).…”

Section: Agentmentioning

confidence: 99%

Sensorimotor contingencies as a key drive of development: from babies to robots

Jacquey¹,

Baldassarre²,

Santucci³

et al. 2019

Preprint

Self Cite

View full text Add to dashboard Cite

Much current work in robotics focuses on the development of robots capable of autonomous unsupervised learning. An essential prerequisite for such learning to be possible is that the agent should be sensitive to the link between its actions and the consequences of its actions, called sensorimotor contingencies. This sensitivity, and more particularly its role as a key drive of development, has been widely studied by developmental psychologists. However, the results of these studies may not necessarily be accessible or intelligible to roboticians. In this paper, we review the main experimental data demonstrating the role of sensitivity to sensorimotor contingencies in infants’ acquisition of four fundamental motor and cognitive abilities: body knowledge, memory, generalization and goal-directedness. We relate this data from development psychology to work in robotics, highlighting the links between these two domains of research. In the last part of the article we present a blueprint architecture demonstrating how exploitation of sensitivity to sensorimotor contingencies, combined with the notion of "goal", allows an agent to develop new sensorimotor skills. This architecture can be used to guide the design of specific computational models, and also to possibly envisage new empirical experiments.

show abstract

“…Reinforcement Learning Barto et al, 2004;Simşek and Barto, 2006;Schembri et al, 2007;Sequeira et al, 2011;Kompella et al, 2012;Baldassarre and Mirolli, 2013;Metzen and Kirchner, 2013;Di Nocera et al, 2014;Frank et al, 2015;Hester and Stone, 2015 Deep Learning Mohamed and Rezende, 2015;Kulkarni et al, 2016;Achiam and Sastry, 2017;Zhelo et al, 2018 Hierarchical Structure Schembri et al, 2007;Baranes and Oudeyer, 2010;Baldassarre and Mirolli, 2013;Santucci et al, 2013;Frank et al, 2015;Kulkarni et al, 2016 Active Learning Baranes and Oudeyer, 2009, 2010Kompella et al, 2017;Pathak et al, 2017 Motion Planning Frank et al, 2015 Affordance Discovery Hart et al, 2008;Hart, 2009 Goal Discovery/Goal Generation In Reinforcement Learning (RL), an agent learns from experience as it deals with a sequential decision problem. The agent interacts with an "environment" which contains a "critic" that provides the agent with rewards by evaluating the behavior.…”

Section: Settings Referencesmentioning

confidence: 99%

Toward Computational Motivation for Multi-Agent Systems and Swarms

2018

View full text Add to dashboard Cite

Motivation is a crucial part of animal and human mental development, fostering competence, autonomy, and open-ended development. Motivational constructs have proved to be an integral part of explaining human and animal behavior. Computer scientists have proposed various computational models of motivation for artificial agents, with the aim of building artificial agents capable of autonomous goal generation. Multi-agent systems and swarm intelligence are natural extensions to the individual agent setting. However, there are only a few works that focus on motivation theories in multi-agent or swarm settings. In this study, we review current computational models of motivation settings, mechanisms, functions and evaluation methods and discuss how we can produce systems with new kinds of functions not possible using individual agents. We describe in detail this open area of research and the major research challenges it holds.

show abstract

Which is the best intrinsic motivation signal for learning multiple skills?

Cited by 53 publications

References 34 publications

Autonomous Reinforcement Learning of Multiple Interrelated Tasks

Autonomous Reinforcement Learning of Multiple Interrelated Tasks

Sensorimotor contingencies as a key drive of development: from babies to robots

Toward Computational Motivation for Multi-Agent Systems and Swarms

Contact Info

Product

Resources

About