Behavioral Diversity Generation in Autonomous Exploration through Reuse of Past Experience

Benureau, Fabien C. Y.; Oudeyer, Pierre-Yves

doi:10.3389/frobt.2016.00008

Cited by 20 publications

(9 citation statements)

References 46 publications

(69 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The concepts introduced with BR-Evolution have also later been employed in the Novelty-based Evolutionary Babbling (Nov-EB) [27] that allows a robot to autonomously discover the possible interactions with objects in its environment. This work draws a first link between the QD-algorithms and the domain of developmental robotics, which is also studied in several other works (see [28] for overview).…”

Section: Gathering and Improving These Solutions Into Collectionsmentioning

confidence: 99%

Quality and Diversity Optimization: A Unifying Modular Framework

Cully

Demiris

2018

IEEE Trans. Evol. Computat.

198

249

View full text Add to dashboard Cite

The optimization of functions to find the best solution according to one or several objectives has a central role in many engineering and research fields. Recently, a new family of optimization algorithms, named Quality-Diversity optimization, has been introduced, and contrasts with classic algorithms. Instead of searching for a single solution, Quality-Diversity algorithms are searching for a large collection of both diverse and high-performing solutions. The role of this collection is to cover the range of possible solution types as much as possible, and to contain the best solution for each type. The contribution of this paper is threefold. Firstly, we present a unifying framework of Quality-Diversity optimization algorithms that covers the two main algorithms of this family (Multi-dimensional Archive of Phenotypic Elites and the Novelty Search with Local Competition), and that highlights the large variety of variants that can be investigated within this family. Secondly, we propose algorithms with a new selection mechanism for Quality-Diversity algorithms that outperforms all the algorithms tested in this paper. Lastly, we present a new collection management that overcomes the erosion issues observed when using unstructured collections. These three contributions are supported by extensive experimental comparisons of Quality-Diversity algorithms on three different experimental scenarios.

show abstract

Section: Gathering and Improving These Solutions Into Collectionsmentioning

confidence: 99%

Quality and Diversity Optimization: A Unifying Modular Framework

Cully

Demiris

2018

IEEE Trans. Evol. Computat.

198

249

View full text Add to dashboard Cite

show abstract

“…Indeed, in many contexts, learning a single pre-defined skill can be difficult as it amounts to searching (the parameters of) a solution with very rare feedback until one is very close to the solution, or with deceptive feedback due to the phenomenon of local minima. A strategy to address these issues is to direct exploration with intrinsic rewards, leading the system to explore a diversity of skills and contingencies which often result in the discovery of new sub-spaces/areas in the problem space, or in mutual skill improvement when exploring one goal/skill provides data that can be used to improve other goals/skills, such as in goal babbling (Baranes and Oudeyer, 2013;Benureau and Oudeyer, 2016) or off-policy reinforcement learning (see the Horde architecture, Sutton et al, 2011). For example, Lehman and Stanley (2011) showed that searching for pure novelty in the behavioural space a robot to find a reward in a maze more efficiently than if it had been searching for behavioural parameters that optimized directly the reward.…”

Section: Intrinsically Motivated Exploration Scaffolds Efficient Multmentioning

confidence: 99%

Intrinsic motivation, curiosity, and learning

Oudeyer¹,

Gottlieb²,

Lopes³

2016

Progress in Brain Research

241

View full text Add to dashboard Cite

This chapter studies the bidirectional causal interactions between curiosity and learning and discusses how understanding these interactions can be leveraged in educational technology applications. First, we review recent results showing how state curiosity, and more generally the experience of novelty and surprise, can enhance learning and memory retention. Then, we discuss how psychology and neuroscience have conceptualized curiosity and intrinsic motivation, studying how the brain can be intrinsically rewarded by novelty, complexity, or other measures of information. We explain how the framework of computational reinforcement learning can be used to model such mechanisms of curiosity. Then, we discuss the learning progress (LP) hypothesis, which posits a positive feedback loop between curiosity and learning. We outline experiments with robots that show how LP-driven attention and exploration can self-organize a developmental learning curriculum scaffolding efficient acquisition of multiple skills/tasks. Finally, we discuss recent work exploiting these conceptual and computational models in educational technologies, showing in particular how intelligent tutoring systems can be designed to foster curiosity and learning.

show abstract

“…At execution time, for a given goal τ , a loss function is defined over the parameterization space through L(θ) = C(τ, D(θ, c)). A black-box optimization algorithm, such as L-BFGS, is then used to optimize this function and find the optimal set of parameters θ (see [3,32,33] for examples of such meta-policy implementations in the IMGEP framework).…”

Section: Meta-policy Mechanismmentioning

confidence: 99%

Intrinsically Motivated Exploration of Learned Goal Spaces

2021

Self Cite

View full text Add to dashboard Cite

Finding algorithms that allow agents to discover a wide variety of skills efficiently and autonomously, remains a challenge of Artificial Intelligence. Intrinsically Motivated Goal Exploration Processes (IMGEPs) have been shown to enable real world robots to learn repertoires of policies producing a wide range of diverse effects. They work by enabling agents to autonomously sample goals that they then try to achieve. In practice, this strategy leads to an efficient exploration of complex environments with high-dimensional continuous actions. Until recently, it was necessary to provide the agents with an engineered goal space containing relevant features of the environment. In this article we show that the goal space can be learned using deep representation learning algorithms, effectively reducing the burden of designing goal spaces. Our results pave the way to autonomous learning agents that are able to autonomously build a representation of the world and use this representation to explore the world efficiently. We present experiments in two environments using population-based IMGEPs. The first experiments are performed on a simple, yet challenging, simulated environment. Then, another set of experiments tests the applicability of those principles on a real-world robotic setup, where a 6-joint robotic arm learns to manipulate a ball inside an arena, by choosing goals in a space learned from its past experience.

show abstract

Behavioral Diversity Generation in Autonomous Exploration through Reuse of Past Experience

Cited by 20 publications

References 46 publications

Quality and Diversity Optimization: A Unifying Modular Framework

Quality and Diversity Optimization: A Unifying Modular Framework

Intrinsic motivation, curiosity, and learning

Intrinsically Motivated Exploration of Learned Goal Spaces

Contact Info

Product

Resources

About