2012
DOI: 10.1007/s10458-012-9200-2
|View full text |Cite
|
Sign up to set email alerts
|

A survey of point-based POMDP solvers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
290
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 360 publications
(300 citation statements)
references
References 22 publications
0
290
0
Order By: Relevance
“…For the purpose of this work, we are using MCVI as a method to solve factored MDPs and demonstrate our technique for refinement on a large problem. We leave the question of determining an optimal setOf States for MCVI as future work, though we note that this question has been extensively studied in point-based value iteration algorithms for POMDPs [14].…”
Section: Experimental Results On Diagnostic Problemsmentioning
confidence: 99%
“…For the purpose of this work, we are using MCVI as a method to solve factored MDPs and demonstrate our technique for refinement on a large problem. We leave the question of determining an optimal setOf States for MCVI as future work, though we note that this question has been extensively studied in point-based value iteration algorithms for POMDPs [14].…”
Section: Experimental Results On Diagnostic Problemsmentioning
confidence: 99%
“…Like MDPs [5] and POMDPs [8], [10], dynamic programming methods have been used in the context of DecPOMDPs [38]. Here, a set of T -step policy trees, one for each agent, is generated from the bottom up.…”
Section: A Optimal Approachesmentioning
confidence: 99%
“…Since then, several solution strategies that focus on the efficiency and feasibility of obtaining a solution have been explored for POMDPs in the AI community [8]- [10]. also been tackled in the control systems literature.…”
Section: Introductionmentioning
confidence: 99%
“…Most research has focused on determining the best set of http://www.jrobio.com/content/1/1/8 belief points [6][7][8] to be evaluated in VI. These methods rely on exploratory/search heuristics to discover a sufficient set of probability densities or sample points to be able to construct a sufficiently accurate approximation of the belief space such that an optimal policy can be found (see [9] for a detailed review on PBVI algorithms).…”
Section: Acting Under Partial Observabilitymentioning
confidence: 99%