2019
DOI: 10.48550/arxiv.1911.06832
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Data-efficient Co-Adaptation of Morphology and Behaviour with Deep Reinforcement Learning

Abstract: Humans and animals are capable of quickly learning new behaviours to solve new tasks. Yet, we often forget that they also rely on a highly specialized morphology that co-adapted with motor control throughout thousands of years. Although compelling, the idea of co-adapting morphology and behaviours in robots is often unfeasible because of the long manufacturing times, and the need to redesign an appropriate controller for each morphology. In this paper, we propose a novel approach to automatically and efficient… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…Similar to our approach, several reinforcement learning-based strategies to co-optimization exist for both continuous [1,32,33,34] and discrete design spaces [10,35,14]. Luck et al [33] use a soft actor-critic algorithm and use a design-conditioned Q-function to evaluate designs.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Similar to our approach, several reinforcement learning-based strategies to co-optimization exist for both continuous [1,32,33,34] and discrete design spaces [10,35,14]. Luck et al [33] use a soft actor-critic algorithm and use a design-conditioned Q-function to evaluate designs.…”
Section: Related Workmentioning
confidence: 99%
“…Similar to our approach, several reinforcement learning-based strategies to co-optimization exist for both continuous [1,32,33,34] and discrete design spaces [10,35,14]. Luck et al [33] use a soft actor-critic algorithm and use a design-conditioned Q-function to evaluate designs. Chen et al [34] model the design space as a differentiable computational graph, which allows them to use standard gradient-based methods.…”
Section: Related Workmentioning
confidence: 99%