2017 IEEE International Conference on Robotics and Automation (ICRA) 2017
DOI: 10.1109/icra.2017.7989250
|View full text |Cite
|
Sign up to set email alerts
|

Learning modular neural network policies for multi-task and multi-robot transfer

Abstract: Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep reinforcement learning to train general purpose neural network policies alleviates some of the burden of manual representation engineering by using expressive policy classes, but exacerbates the challenge of data collection, since such methods tend to be less efficient than R… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
177
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 244 publications
(179 citation statements)
references
References 20 publications
2
177
0
Order By: Relevance
“…In addition, the auxiliary tasks in table 1-such as the sun position, the sky image, the sun position variation and the sky intensity variation-are independent from the solar panel. Evidence from the literature (Devin et al, 2017), which demonstrates that networks trained on multiple tasks have can better generalize on other datasets, leads us to believe that the "LSTM-Full" model may have a better chance of adapting to another site, for example with the same fine-tuning strategy mentioned above.…”
Section: Discussion Of Limitationsmentioning
confidence: 99%
“…In addition, the auxiliary tasks in table 1-such as the sun position, the sky image, the sun position variation and the sky intensity variation-are independent from the solar panel. Evidence from the literature (Devin et al, 2017), which demonstrates that networks trained on multiple tasks have can better generalize on other datasets, leads us to believe that the "LSTM-Full" model may have a better chance of adapting to another site, for example with the same fine-tuning strategy mentioned above.…”
Section: Discussion Of Limitationsmentioning
confidence: 99%
“…To alleviate some of these problems while retaining the advantages of data-driven methods, a number of works propose to structure the navigation system into two modules: perception and control [28]- [32]. This kind of modularity has proven to be particularly important for transferring sensorimotor systems across different tasks [29], [31] and application domains [30], [32].…”
Section: A Data-driven Algorithms For Autonomous Navigationmentioning
confidence: 99%
“…This is in contrast to previous works using unstructured task representations and policies [8,11]. The use of compositionality has led to better generalization in Visual Question Answering [17,20,24] and Policy Learning [3,7,38]. We propose Neural Task Graph (NTG) Networks, a novel framework that uses task graph as the intermediate representation to explicitly modularize both the visual demonstration and the derived policy.…”
Section: Conjugate Task Graphmentioning
confidence: 97%