2022
DOI: 10.48550/arxiv.2202.01258
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accelerated Quality-Diversity for Robotics through Massive Parallelism

Abstract: Quality-Diversity (QD) algorithms are a well-known approach to generate large collections of diverse and high-quality policies. However, QD algorithms are also known to be data-inefficient, requiring large amounts of computational resources and are slow when used in practice for robotics tasks. Policy evaluations are already commonly performed in parallel to speed up QD algorithms but have limited capabilities on a single machine as most physics simulators run on CPUs. With recent advances in simulators that r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(13 citation statements)
references
References 40 publications
0
13
0
Order By: Relevance
“…Since CMA-MAE is a general purpose algorithm, we are excited about future work that will test the CMA-MAE variants in domains beyond robot locomotion, such as robotic manipulation [25] and scenario generation [26]. Future work will also test the pretrained controllers in the real world and will explore the computational benefits of recent hardwareaccelerated frameworks [56].…”
Section: Discussionmentioning
confidence: 99%
“…Since CMA-MAE is a general purpose algorithm, we are excited about future work that will test the CMA-MAE variants in domains beyond robot locomotion, such as robotic manipulation [25] and scenario generation [26]. Future work will also test the pretrained controllers in the real world and will explore the computational benefits of recent hardwareaccelerated frameworks [56].…”
Section: Discussionmentioning
confidence: 99%
“…Recent advances in hardware acceleration have led to new QD libraries such as QDax [20] or EvoJax [21]. These tools rely on highly-parallelised simulators like Brax [22] that can run on accelerators (e.g., GPUs and TPUs) and thus target simulated domains, for example, robotics control, where they drastically reduce the evaluation time.…”
Section: B Hardware-accelerated Quality-diversitymentioning
confidence: 99%
“…In addition, they have given us access to 10 or 100 times more evaluations per generation within the same amount of time. Lim et al [20] prove that the performance of MAP-Elites is robust to large increases in batch-size values (i.e. large increases in the number of solutions generated per generation).…”
Section: B Hardware-accelerated Quality-diversitymentioning
confidence: 99%
See 2 more Smart Citations