On The Transferability of Deep-Q Networks

Sabatelli, Matthia; Geurts, Pierre

doi:10.48550/arxiv.2110.02639

Search citation statements

Order By: Relevance

Paper Sections

Select...

Related Work1

Introduction1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(2 citation statements)

References 26 publications

(29 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Actor-critic methods [69,94] are hybrid approaches that use policy-based methods to improve a policy while also evaluating it by estimating its corresponding value function. Several studies, including [22,95], investigated the adaptability of value-based algorithms to environmental changes. They trained a value function on a source task and then transferred the value function's parameters to a new task that differed from the source task in transition dynamics.…”

Section: Related Workmentioning

confidence: 99%

“…MF methods enable quick and computationally efficient action selection at decision time. However, as recent research [22,23,24,25] has shown, the adaptability of MF frameworks to environmental changes does not appear to be promising. This is due to the fact that an MF agent cannot adapt cached values of all states to changes in the environment.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation