2021
DOI: 10.1007/s00165-021-00536-5
|View full text |Cite
|
Sign up to set email alerts
|

L∗-based learning of Markov decision processes (extended version)

Abstract: Automata learning techniques automatically generate systemmodels fromtest observations. Typically, these techniques fall into two categories: passive and active. On the one hand, passive learning assumes no interaction with the system under learning and uses a predetermined training set, e.g., system logs. On the other hand, active learning techniques collect training data by actively querying the system under learning, allowing one to steer the discovery ofmeaningful information about the systemunder learning… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(18 citation statements)
references
References 61 publications
1
15
0
Order By: Relevance
“…The following result follows from the convergence of L for label-deterministic MDPs, shown in (Tappler et al 2021). In that work, the authors show that under uniformly randomized testing strategies, the sampling-based L algorithm converges almost surely in the limit to the MDP under learning.…”
Section: Correctness Convergence and Compactnessmentioning
confidence: 92%
See 4 more Smart Citations
“…The following result follows from the convergence of L for label-deterministic MDPs, shown in (Tappler et al 2021). In that work, the authors show that under uniformly randomized testing strategies, the sampling-based L algorithm converges almost surely in the limit to the MDP under learning.…”
Section: Correctness Convergence and Compactnessmentioning
confidence: 92%
“…The L algorithm for learning regular languages (Angluin 1987) is the quintessential example of active inference, and assumes the existence of a minimally adequate teacher capable of answering membership and equivalence queries. This method has been broadly adopted and generalized to learn interface automata (Aarts and Vaan-drager 2010), Mealy machines (Niese 2003), automaton representations of recurrent neural networks Yahav 2018, 2019), and MDPs (Tappler et al 2021).…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations