Model-Based Offline Meta-Reinforcement Learning with Regularization

Shen, Lin; Wan, Jialin; Xu, Tengyu; Liang, Y. T.; Zhang, Junshan

doi:10.48550/arxiv.2202.02929

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 6 publications

(15 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Going beyond methods that only perform fine-tuning from a learned initialization with online interaction [40,25,31], we consider two independent fine-tuning settings: (1) the setting where we do not use any online interaction and fine-tune the pre-trained policy entirely offline, (2) the setting where a limited amount of online interaction is allowed to autonomously acquire the skills to solve the task from a challenging initial condition. This resembles the problem setting considered by offline meta-RL methods [33,8,39,45,34]. However, our approach is simpler as we fine-tune the very same offline RL algorithm that we use for pre-training.…”

Section: Introductionmentioning

confidence: 99%