2022
DOI: 10.1007/978-3-031-15008-1_1
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning with Guarantees that Hold for Ever

Abstract: Reinforcement learning is a successful explore-and-exploit approach, where a controller tries to learn how to navigate an unknown environment. The principle approach is for an intelligent agent to learn how to maximise expected rewards. But what happens if the objective refers to non-terminating systems? We can obviously not wait until an infinite amount of time has passed, assess the success, and update. But what can we do? This talk will tell.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 17 publications
(23 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?