2024
DOI: 10.1609/aaai.v38i19.30148
|View full text |Cite
|
Sign up to set email alerts
|

A PAC Learning Algorithm for LTL and Omega-Regular Objectives in MDPs

Mateo Perez,
Fabio Somenzi,
Ashutosh Trivedi

Abstract: Linear temporal logic (LTL) and omega-regular objectives---a superset of LTL---have seen recent use as a way to express non-Markovian objectives in reinforcement learning. We introduce a model-based probably approximately correct (PAC) learning algorithm for omega-regular objectives in Markov decision processes (MDPs). As part of the development of our algorithm, we introduce the epsilon-recurrence time: a measure of the speed at which a policy converges to the satisfaction of the omega-regular objective in th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 21 publications
0
0
0
Order By: Relevance