2013
DOI: 10.1007/978-3-642-40988-2_13
|View full text |Cite
|
Sign up to set email alerts
|

Continuous Upper Confidence Trees with Polynomial Exploration – Consistency

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
56
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(56 citation statements)
references
References 12 publications
0
56
0
Order By: Relevance
“…Given the importance of establishing confidence for an autonomy system in the environmental and earth sciences, the key challenge to overcome is related to addressing tree degeneracy in continuous, partially-observable domains and formulating performance guarantees of the framework. This thesis extends planners for fully-observable continuous-domains [21,22] to partially-observable domains, and demonstrates that theoretical performance guarantees are preserved.…”
Section: Decision-making and Planning In Continuous Domainsmentioning
confidence: 90%
See 1 more Smart Citation
“…Given the importance of establishing confidence for an autonomy system in the environmental and earth sciences, the key challenge to overcome is related to addressing tree degeneracy in continuous, partially-observable domains and formulating performance guarantees of the framework. This thesis extends planners for fully-observable continuous-domains [21,22] to partially-observable domains, and demonstrates that theoretical performance guarantees are preserved.…”
Section: Decision-making and Planning In Continuous Domainsmentioning
confidence: 90%
“…However, this assumption can compromise search and has optimality guarantees only in linear-Gaussian systems [192]. Instead, PLUMES uses Monte Carlo Tree Search (MCTS) with progressive widening, referred to as continuous-observation MCTS, to limit the growth of the planning tree [22] and retain optimality [21] in continuous environments.…”
Section: Significance and Role Of Transiencementioning
confidence: 99%
“…Theoretically, PLUMES can be shown to select asymptotically optimal actions. We briefly describe how analysis in Auger et al [17] for PUCT-MCTS with progressive widening in MDPs can be extended to PLUMES. * Refer to Table 1 of Auger et al [17] for parameter settings.…”
Section: B Planning With Continuous-observation Mctsmentioning
confidence: 99%
“…Subsequently, Theorem 1 in Auger et al [17] shows that for an MDP with a continuous state space, like the belief-state MDP representation suggested, the value function estimated by continuous-observation MCTS asymptotically converges to that of the optimal policy:…”
Section: B Planning With Continuous-observation Mctsmentioning
confidence: 99%
See 1 more Smart Citation