2018
DOI: 10.1137/16m1084924
|View full text |Cite
|
Sign up to set email alerts
|

Experimental Design for Partially Observed Markov Decision Processes

Abstract: sented. We discuss how parameter dependence within these methods can be dealt with by the use of priors, and develop tools to update control policies online. This is demonstrated in another stochastic dynamical system describing growth dynamics of DNA template in a PCR model.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(10 citation statements)
references
References 11 publications
0
10
0
Order By: Relevance
“…The second term, E y|(t,x) M i−1 (t, y), is the maximal information expected thereafter. Consequently we say that M i is the maximal Fisher information to go (FITG) for i observations, a phrase adopted from [1] and [6]. 1 Without loss of generality, we assume that, for all s and i, the supremum in (3) is achieved on the interval (s, τ ).…”
Section: Proposed Designmentioning
confidence: 99%
See 4 more Smart Citations
“…The second term, E y|(t,x) M i−1 (t, y), is the maximal information expected thereafter. Consequently we say that M i is the maximal Fisher information to go (FITG) for i observations, a phrase adopted from [1] and [6]. 1 Without loss of generality, we assume that, for all s and i, the supremum in (3) is achieved on the interval (s, τ ).…”
Section: Proposed Designmentioning
confidence: 99%
“…Discretizing Time: Often the δ needed to make the coefficients in (6) positive is so small that our policy scarcely changes across time increments of that size. For greater computational efficiency, we discretize time with the mesh…”
Section: Additional Detailsmentioning
confidence: 99%
See 3 more Smart Citations