2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA) 2021
DOI: 10.1109/icmla52953.2021.00195
|View full text |Cite
|
Sign up to set email alerts
|

Active Learning of Markov Decision Processes using Baum-Welch algorithm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 19 publications
0
0
0
Order By: Relevance
“…Value Iteration (VI) was originally introduced to approximate expected rewards in MDPs [7]. In a broader sense, VI simply refers to iterating a function f : R |S| → R |S| (called Bellman operator in the MDP setting) from some given initial vector x (0) , i.e., to compute the sequence x (1) = f (x (0) ), x (2) = f (x (1) ), etc. Instances of VI are usually set up such that the sequence converges to a (generally non-unique) fixed point x = f (x).…”
Section: Value Iterationmentioning
confidence: 99%
See 1 more Smart Citation
“…Value Iteration (VI) was originally introduced to approximate expected rewards in MDPs [7]. In a broader sense, VI simply refers to iterating a function f : R |S| → R |S| (called Bellman operator in the MDP setting) from some given initial vector x (0) , i.e., to compute the sequence x (1) = f (x (0) ), x (2) = f (x (1) ), etc. Instances of VI are usually set up such that the sequence converges to a (generally non-unique) fixed point x = f (x).…”
Section: Value Iterationmentioning
confidence: 99%
“…EVTs are also employed in an algorithm proposed in [8] for LTL model checking of interval Markov chains. Moreover, EVTs have been leveraged for minimizing and learning DTMCs [1,2]. Further recent applications of EVTs to MDPs include verifying cause-effect dependencies [3], as well as an abstraction-refinement procedure that measures the importance of states based on the EVTs under a fixed policy [33].…”
Section: Related Workmentioning
confidence: 99%