2021
DOI: 10.1609/aaai.v35i13.17401
|View full text |Cite
|
Sign up to set email alerts
|

Robust Finite-State Controllers for Uncertain POMDPs

Abstract: Uncertain partially observable Markov decision processes (uPOMDPs) allow the probabilistic transition and observation functions of standard POMDPs to belong to a so-called uncertainty set. Such uncertainty, referred to as epistemic uncertainty, captures uncountable sets of probability distributions caused by, for instance, a lack of data available. We develop an algorithm to compute finite-memory policies for uPOMDPs that robustly satisfy specifications against any admissible distribution. In general, computin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

3
3

Authors

Journals

citations
Cited by 16 publications
(15 citation statements)
references
References 27 publications
0
15
0
Order By: Relevance
“…A policy represented by an FSC can get arbitrarily close to the optimal policy as κ grows (Bonet 2002). Computing finite-state controllers that aim to maximize the expected (discounted) reward can be done in several ways, such as gradient descent (Meuleau et al 1999b) and convex optimization (Amato, Bernstein, and Zilberstein 2010;Junges et al 2018;Cubuktepe et al 2021).…”
Section: Mdps Pomdps and Fscsmentioning
confidence: 99%
“…A policy represented by an FSC can get arbitrarily close to the optimal policy as κ grows (Bonet 2002). Computing finite-state controllers that aim to maximize the expected (discounted) reward can be done in several ways, such as gradient descent (Meuleau et al 1999b) and convex optimization (Amato, Bernstein, and Zilberstein 2010;Junges et al 2018;Cubuktepe et al 2021).…”
Section: Mdps Pomdps and Fscsmentioning
confidence: 99%
“…For a detailed handling of different types of uncertainty sets, we refer to [83]. Extensions to uncertain POMDPs also exist [31,79]. Distributions over Parameters.…”
Section: Epiloguementioning
confidence: 99%
“…The methods presented in [13,22] exploit a repetitive structure in parametric MCs to accelerate the construction of closed form solutions and are not applicable to MDPs. Parametric models have been used to support the design of systems [2,8] or their adaption [6,9], to find policies for partially observable systems [11], to analyse Bayesian networks [34], and to speed up the analysis of, e.g., software product lines [10,37]. On top of technical differences, none of these approaches uses a hierarchical decomposition of an MDP or uses the results of the analysis in the analysis of a larger MDP.…”
Section: Related Workmentioning
confidence: 99%