Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence 2020
DOI: 10.24963/ijcai.2020/569
|View full text |Cite
|
Sign up to set email alerts
|

Robust Policy Synthesis for Uncertain POMDPs via Convex Optimization

Abstract: We study the problem of policy synthesis for uncertain partially observable Markov decision processes (uPOMDPs). The transition probability function of uPOMDPs is only known to belong to a so-called uncertainty set, for instance in the form of probability intervals. Such a model arises when, for example, an agent operates under information limitation due to imperfect knowledge about the accuracy of its sensors. The goal is to compute a policy for the agent that is robust against all possible probabi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

5
3

Authors

Journals

citations
Cited by 13 publications
(18 citation statements)
references
References 5 publications
0
18
0
Order By: Relevance
“…Notably, we introduce a novel robust spacecraft motion planning scenario. We show that our method scales to significantly larger models than (Suilen et al 2020). This scalability advantage allows more precise models and adding memory to the policies.…”
Section: Contribution and Approachmentioning
confidence: 86%
See 2 more Smart Citations
“…Notably, we introduce a novel robust spacecraft motion planning scenario. We show that our method scales to significantly larger models than (Suilen et al 2020). This scalability advantage allows more precise models and adding memory to the policies.…”
Section: Contribution and Approachmentioning
confidence: 86%
“…Combining this dualization with a linear-time transformation of the uPOMDP to a so-called simple uPOMDP (Junges et al 2018) ensures exact solutions to the original problem. The exact solutions and the moderate increase in the problem size contrasts with an over-approximative solution computed using an exponentially larger encoding proposed in (Suilen et al 2020). Finite nonconvex problem.…”
Section: Contribution and Approachmentioning
confidence: 96%
See 1 more Smart Citation
“…Ahmadi et al (2020) use control barrier functions to compute safe reachable sets in the belief space of POMDPs. Extensions to epistemic or uncertain POMDPs compute FSCs using convex optimization (Cubuktepe et al, 2021;Suilen et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…As an extension of the first result, in [4], we studied the problem of policy synthesis for uncertain POMDPs. The transition probability function of uncertain POMDPs is only known to belong to a so-called uncertainty set, for instance in the form of probability intervals.…”
Section: Ccs Conceptsmentioning
confidence: 99%