2005
DOI: 10.21236/ada438510
|View full text |Cite
|
Sign up to set email alerts
|

Convergence of Sample Path Optimal Policies for Stochastic Dynamic Programming

Abstract: ISR develops, applies and teaches advanced methodologies of design and analysis to AbstractWe consider the solution of stochastic dynamic programs using sample path estimates. Applying the theory of large deviations, we derive probability error bounds associated with the convergence of the estimated optimal policy to the true optimal policy, for finite horizon problems. These bounds decay at an exponential rate, in contrast with the usual canonical (inverse) square root rate associated with estimation of the v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 13 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?