2021
DOI: 10.48550/arxiv.2110.02102
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning

Abstract: While Reinforcement Learning has made great strides towards solving ever more complicated tasks, many algorithms are still brittle to even slight changes in their environment. This is a limiting factor for real-world applications of RL. Although the research community continuously aims at improving both robustness and generalization of RL algorithms, unfortunately it still lacks an open-source set of well-defined benchmark problems based on a consistent theoretical framework, which allows comparing different a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…Furthermore, we present additional experiments in Appendix suggesting that our model can potentially generalize to non-Markovian and state-dependent settings. While we presented several experiments in various environments, further experimental evaluation is required, e.g., using Benjamins et al (2021).…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Furthermore, we present additional experiments in Appendix suggesting that our model can potentially generalize to non-Markovian and state-dependent settings. While we presented several experiments in various environments, further experimental evaluation is required, e.g., using Benjamins et al (2021).…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…In the literature, several methods exist that include explicit contextual information. For example, Benjamins et al [41], [42] introduced a framework designed to solve CMDPs and a benchmark library. The framework includes information such as gravity, target distance, actuator strength, and joint stiffness in the learning process.…”
Section: Explicit Context-based Methodsmentioning
confidence: 99%