DOI: 10.1007/978-3-540-87481-2_32
|View full text |Cite
|
Sign up to set email alerts
|

Transferring Instances for Model-Based Reinforcement Learning

Abstract: Reinforcement learning agents typically require a significant amount of data before performing well on complex tasks. Transfer learning methods have made progress reducing sample complexity, but they have primarily been applied to model-free learning methods, not more data-efficient model-based learning methods. This paper introduces timbrel, a novel method capable of transferring information effectively into a model-based reinforcement learning algorithm. We demonstrate that timbrel can significantly improve … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
83
0
1

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 99 publications
(85 citation statements)
references
References 15 publications
0
83
0
1
Order By: Relevance
“…The type of knowledge that can be transferred between tasks varies among different TL methods, including value functions [8], entire policies [9], actions (policy advice) [10], or a set of samples from a source task that can be used by a model-based RL algorithm in a target task [11].…”
Section: Transfer Learning and Advising Under A Budgetmentioning
confidence: 99%
“…The type of knowledge that can be transferred between tasks varies among different TL methods, including value functions [8], entire policies [9], actions (policy advice) [10], or a set of samples from a source task that can be used by a model-based RL algorithm in a target task [11].…”
Section: Transfer Learning and Advising Under A Budgetmentioning
confidence: 99%
“…In order to cluster tasks into different classes, Lazaric and Ghavamzadeh (2010) place Fig. 6 Transfer from 2D to 3D mountain car (Taylor et al, 2008a). …”
Section: Parameter Transfermentioning
confidence: 99%
“…The level of knowledge that can be transferred across tasks can be low, such as tuples of the form s, a, r, s ′ [6,10], value-functions [12] or policies [2]. Higher level knowledge may include rules [7,13], action subsets or shaping rewards [5].…”
Section: Transfer Learning In Rlmentioning
confidence: 99%