Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials

Perez, Christian F.; Such, Felipe Petroski; Karaletsos, Theofanis

doi:10.48550/arxiv.2002.03072

Cited by 1 publication

(1 citation statement)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This has a very practical benefit as it limits the length of applicable interaction history. Current work, also typically assumes a stationary task state distribution for meta-learning (Doshi-Velez and Konidaris, 2013;Wang et al, 2016;Zintgraf et al, 2018;Rakelly et al, 2019;Zintgraf et al, 2019;Humplik et al, 2019;Fakoor et al, 2019;Perez et al, 2020). However, this framework has also has been readily applicable to more challenging multi-agent learning settings (Da Silva et al, 2006;Amato et al, 2013;Marinescu et al, 2017;Vezhnevets et al, 2019).…”

Section: Context Detectionmentioning

confidence: 99%

Towards Continual Reinforcement Learning: A Review and Perspectives

Khetarpal¹,

Riemer²,

Rish³

et al. 2020

Preprint

View full text Add to dashboard Cite

In this article, we aim to provide a literature review of different formulations and approaches to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We begin by discussing our perspective on why RL is a natural fit for studying continual learning. We then provide a taxonomy of different continual RL formulations and mathematically characterize the non-stationary dynamics of each setting. We go on to discuss evaluation of continual RL agents, providing an overview of benchmarks used in the literature and important metrics for understanding agent performance. Finally, we highlight open problems and challenges in bridging the gap between the current state of continual RL and findings in neuroscience. While still in its early days, the study of continual RL has the promise to develop better incremental reinforcement learners that can function in increasingly realistic applications where non-stationarity plays a vital role. These include applications such as those in the fields of healthcare, education, logistics, and robotics. 1 * . The authors contributed equally to this work. 1. This survey is a continual work in progress, so please reach out if we have omitted any important references.

show abstract