Jean-Baptiste Mouret scite author profile

As robots leave the controlled environments of factories to autonomously function in more complex, natural environments 1,2,3 , they will have to respond to the inevitable fact that they will become damaged 4,5 . However, while animals can quickly adapt to a wide variety of injuries, current robots cannot "think outside the box" to find a compensatory behavior when damaged: they are limited to their pre-specified selfsensing abilities, can diagnose only anticipated failure modes 6 , and require a pre-programmed contingency plan for every type of potential damage, an impracticality for complex robots 4,5 . Here we introduce an intelligent trial and error algorithm that allows robots to adapt to damage in less than two minutes, without requiring self-diagnosis or pre-specified contingency plans. Before deployment, a robot exploits a novel algorithm to create a detailed map of the space of high-performing behaviors: This map represents the robot's intuitions about what behaviors it can perform and their value. If the robot is damaged, it uses these intuitions to guide a trial-and-error learning algorithm that conducts intelligent experiments to rapidly discover a compensatory behavior that works in spite of the damage. Experiments reveal successful adaptations for a legged robot injured in five different ways, including damaged, broken, and missing legs, and for a robotic arm with joints broken in 14 different ways. This new technique will enable more robust, effective, autonomous robots, and suggests principles that animals may use to adapt to injury.Robots have transformed the economics of many industries, most notably manufacturing 7 , and have the power to deliver tremendous benefits to society, such as in search and rescue 8 , disaster response 9 , health care 3 , and transportation 10 . They are also invaluable tools for scientific exploration, whether of distant planets 1,4 or deep oceans 2 . A major obstacle to their widespread adoption in more complex environments outside of factories is their fragility 4,5 : Robots presently pale in comparison to natural animals in their ability to invent compensatory behaviors after an injury (Fig. 1A).Current damage recovery in robots typically involves two phases: self-diagnosis, and then selecting the best, pre-designed contingency plan 11,12,13,14 . Such self-diagnosing robots are expensive, because self-monitoring sensors are expensive, and are difficult to design, because robot engineers cannot foresee every possible situation: this approach often fails either because the diagnosis is incorrect 12,13 or because an appropriate contingency plan is not provided 14 . Injured animals respond differently: they learn by trial and error how to compensate for damage (e.g. learning which limp minimizes pain) 15,16 . Similarly, trial-and-error learning algorithms could allow robots to creatively discover compensatory behaviors without being limited to their designers' assumptions about how damage may occur and how to compensate for each damage type. However, state-of-the-art ...

show abstract

The evolutionary origins of modularity

Clune

2013

View full text Add to dashboard Cite

A central biological question is how natural organisms are so evolvable (capable of quickly adapting to new environments). A key driver of evolvability is the widespread modularity of biological networks—their organization as functional, sparsely connected subunits—but there is no consensus regarding why modularity itself evolved. Although most hypotheses assume indirect selection for evolvability, here we demonstrate that the ubiquitous, direct selection pressure to reduce the cost of connections between network nodes causes the emergence of modular networks. Computational evolution experiments with selection pressures to maximize network performance and minimize connection costs yield networks that are significantly more modular and more evolvable than control experiments that only select for performance. These results will catalyse research in numerous disciplines, such as neuroscience and genetics, and enhance our ability to harness evolution for engineering purposes.

show abstract

Encouraging Behavioral Diversity in Evolutionary Robotics: An Empirical Study

Mouret

Doncieux

2012

Evolutionary Computation

225

348

View full text Add to dashboard Cite

Evolutionary robotics (ER) aims at automatically designing robots or controllers of robots without having to describe their inner workings. To reach this goal, ER researchers primarily employ phenotypes that can lead to an infinite number of robot behaviors and fitness functions that only reward the achievement of the task—and not how to achieve it. These choices make ER particularly prone to premature convergence. To tackle this problem, several papers recently proposed to explicitly encourage the diversity of the robot behaviors, rather than the diversity of the genotypes as in classic evolutionary optimization. Such an approach avoids the need to compute distances between structures and the pitfalls of the noninjectivity of the phenotype/behavior relation; however, it also introduces new questions: how to compare behavior? should this comparison be task specific? and what is the best way to encourage diversity in this context? In this paper, we review the main published approaches to behavioral diversity and benchmark them in a common framework. We compare each approach on three different tasks and two different genotypes. The results show that fostering behavioral diversity substantially improves the evolutionary process in the investigated experiments, regardless of genotype or task. Among the benchmarked approaches, multi-objective methods were the most efficient and the generic, Hamming-based, behavioral distance was at least as efficient as task specific behavioral metrics.

show abstract

Discovery of a big void in Khufu’s Pyramid by observation of cosmic-ray muons

Morishima

Kuno

Nishio

et al. 2017

Nature

332

241

View full text Add to dashboard Cite

The Great Pyramid, or Khufu's Pyramid, was built on the Giza plateau in Egypt during the fourth dynasty by the pharaoh Khufu (Cheops), who reigned from 2509 bc to 2483 bc. Despite being one of the oldest and largest monuments on Earth, there is no consensus about how it was built. To understand its internal structure better, we imaged the pyramid using muons, which are by-products of cosmic rays that are only partially absorbed by stone. The resulting cosmic-ray muon radiography allows us to visualize the known and any unknown voids in the pyramid in a non-invasive way. Here we report the discovery of a large void (with a cross-section similar to that of the Grand Gallery and a minimum length of 30 metres) situated above the Grand Gallery. This constitutes the first major inner structure found in the Great Pyramid since the nineteenth century. The void, named ScanPyramids' Big Void, was first observed with nuclear emulsion films installed in the Queen's chamber, then confirmed with scintillator hodoscopes set up in the same chamber and finally re-confirmed with gas detectors outside the pyramid. This large void has therefore been detected with high confidence by three different muon detection technologies and three independent analyses. These results constitute a breakthrough for the understanding of the internal structure of Khufu's Pyramid. Although there is currently no information about the intended purpose of this void, these findings show how modern particle physics can shed new light on the world's archaeological heritage.

show abstract

The Transferability Approach: Crossing the Reality Gap in Evolutionary Robotics

Koos

Mouret

Doncieux

2013

IEEE Trans. Evol. Computat.

209

184

View full text Add to dashboard Cite

Abstract-The reality gap, that often makes controllers evolved in simulation inefficient once transferred onto the physical robot, remains a critical issue in Evolutionary Robotics (ER). We hypothesize that this gap highlights a conflict between the efficiency of the solutions in simulation and their transferability from simulation to reality: the most efficient solutions in simulation often exploit badly modeled phenomena to achieve high fitness values with unrealistic behaviors. This hypothesis leads to the Transferability approach, a multi-objective formulation of ER in which two main objectives are optimized via a Pareto-based Multi-Objective Evolutionary Algorithm: (1) the fitness and (2) the transferability, estimated by a simulation-to-reality (STR) disparity measure. To evaluate this second objective, a surrogate model of the exact STR disparity is built during the optimization. This Transferability approach has been compared to two realitybased optimization methods, a noise-based approach inspired from Jakobi's minimal simulation methodology and a local search approach. It has been validated on two robotic applications: 1) a navigation task with an e-puck robot; 2) a walking task with an 8-DOF quadrupedal robot. For both experimental set-ups, our approach successfully finds efficient and well-transferable controllers only with about ten experiments on the physical robot.

show abstract

Novelty-Based Multiobjectivization

Mouret

2011

111

163

View full text Add to dashboard Cite

Novelty search is a recent and promising approach to evolve neurocontrollers, especially to drive robots. The main idea is to maximize the novelty of behaviors instead of the efficiency. However, abandoning the efficiency objective(s) may be too radical in many contexts. In this paper, a Paretobased multi-objective evolutionary algorithm is employed to reconcile novelty search with objective-based optimization by following a multiobjectivization process. Several multiobjectivizations based on behavioral novelty and on behavioral diversity are compared on a maze navigation task. Results show that the bi-objective variant "Novelty + Fitness" is better at fine-tuning behaviors than basic novelty search, while keeping a comparable number of iterations to converge.

show abstract

Evolutionary Robotics: What, Why, and Where to

et al. 2015

View full text Add to dashboard Cite

Evolutionary robotics applies the selection, variation, and heredity principles of natural evolution to the design of robots with embodied intelligence. It can be considered as a subfield of robotics that aims to create more robust and adaptive robots. A pivotal feature of the evolutionary approach is that it considers the whole robot at once, and enables the exploitation of robot features in a holistic manner. Evolutionary robotics can also be seen as an innovative approach to the study of evolution based on a new kind of experimentalism. The use of robots as a substrate can help to address questions that are difficult, if not impossible, to investigate through computer simulations or biological studies. In this paper, we consider the main achievements of evolutionary robotics, focusing particularly on its contributions to both engineering and biology. We briefly elaborate on methodological issues, review some of the most interesting findings, and discuss important open issues and promising avenues for future work.

show abstract

Neural Modularity Helps Organisms Evolve to Learn New Skills without Forgetting Old Skills

2015

View full text Add to dashboard Cite

A long-standing goal in artificial intelligence is creating agents that can learn a variety of different skills for different problems. In the artificial intelligence subfield of neural networks, a barrier to that goal is that when agents learn a new skill they typically do so by losing previously acquired skills, a problem called catastrophic forgetting. That occurs because, to learn the new task, neural learning algorithms change connections that encode previously acquired skills. How networks are organized critically affects their learning dynamics. In this paper, we test whether catastrophic forgetting can be reduced by evolving modular neural networks. Modularity intuitively should reduce learning interference between tasks by separating functionality into physically distinct modules in which learning can be selectively turned on or off. Modularity can further improve learning by having a reinforcement learning module separate from sensory processing modules, allowing learning to happen only in response to a positive or negative reward. In this paper, learning takes place via neuromodulation, which allows agents to selectively change the rate of learning for each neural connection based on environmental stimuli (e.g. to alter learning in specific locations based on the task at hand). To produce modularity, we evolve neural networks with a cost for neural connections. We show that this connection cost technique causes modularity, confirming a previous result, and that such sparsely connected, modular networks have higher overall performance because they learn new skills faster while retaining old skills more and because they have a separate reinforcement learning module. Our results suggest (1) that encouraging modularity in neural networks may help us overcome the long-standing barrier of networks that cannot learn new skills without forgetting old ones, and (2) that one benefit of the modularity ubiquitous in the brains of natural animals might be to alleviate the problem of catastrophic forgetting.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.