2018
DOI: 10.48550/arxiv.1804.03720
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Gotta Learn Fast: A New Benchmark for Generalization in RL

Abstract: In this report, we present a new reinforcement learning (RL) benchmark based on the Sonic the Hedgehog TM video game franchise. This benchmark is intended to measure the performance of transfer learning and few-shot learning algorithms in the RL domain. We also present and evaluate some baseline algorithms on the new benchmark.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
63
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 54 publications
(64 citation statements)
references
References 16 publications
1
63
0
Order By: Relevance
“…Exploration for Procedurally-Generated Environments. Several recent studies have discussed the generalization of reinforcement learning (Rajeswaran et al, 2017;Zhang et al, 2018a;b;Choi et al, 2018) and designed procedurally-generated environments to test the generalization of reinforcement learning (Beattie et al, 2016;Nichol et al, 2018;. More recent papers show that traditional exploration methods fall short in procedurally-generated environments and address this issue with new exploration methods Campero et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Exploration for Procedurally-Generated Environments. Several recent studies have discussed the generalization of reinforcement learning (Rajeswaran et al, 2017;Zhang et al, 2018a;b;Choi et al, 2018) and designed procedurally-generated environments to test the generalization of reinforcement learning (Beattie et al, 2016;Nichol et al, 2018;. More recent papers show that traditional exploration methods fall short in procedurally-generated environments and address this issue with new exploration methods Campero et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…are designed to test the generalization of RL, such as (Beattie et al, 2016;Nichol et al, 2018;Côté et al, 2018;Cobbe et al, 2019;, in which the agent aims to solve the same task, but a different environment is generated in each episode.…”
Section: Introductionmentioning
confidence: 99%
“…Related to our approach is a strand of literature that assumes there exists a distribution of Markov-decision-problems of the scenario of interest, and then trains algorithms on a finite set of samples from this distribution before testing the behavior on the entire distribution (e.g. Zhang et al, 2018a;Nichol et al, 2018;Justesen et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
“…Model-free RL, like model-based RL, has also suffered from both the "train=test" paradigm and a lack of standardization around how to measure generalization. In response, recent papers have discussed what generalization in RL means and how to measure it [7,8,36,49,71], and others have proposed new environments such as Procgen [9] and Meta-World [74] as benchmarks focusing on measuring generalization. While popular in the model-free community [e.g.…”
Section: Introductionmentioning
confidence: 99%