X. G. Wu scite author profile

This paper deals with the first passage optimality and variance minimisation problems of discrete-time Markov decision processes (MDPs) with varying discount factors and unbounded rewards/costs. First, under suitable conditions slightly weaker than those in the previous literature on the standard (infinite horizon) discounted MDPs, we establish the existence and characterisation of the first passage expected-optimal stationary policies. Second, to further distinguish the expected-optimal stationary policies, we introduce the variance minimisation problem, prove that it is equivalent to a new first passage optimality problem of MDPs, and, thus, show the existence of a variance-optimal policy that minimises the variance over the set of all first passage expected-optimal stationary policies. Finally, we use a computable example to illustrate our main results and also to show the difference between the first passage optimality here and the standard discount optimality of MDPs in the previous literature.

show abstract

Level structure in the transitional nucleus Tl199

Zheng

et al. 2018

Phys. Rev. C

View full text Add to dashboard Cite

High-spin states inGd152

Wang¹,

Hua²,

Meng³

et al. 2005

Phys. Rev. C

View full text Add to dashboard Cite

Evolution of octupole correlations inBa123

Chen¹,

Zhao²,

Xu³

et al. 2016

Phys. Rev. C

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

X. G. Wu

Lifetime measurements inPt180

Nonaxial shapes in the odd-odd194Au nucleus

Band structures in106Pd

Observation of a secondπh11/2⊗νh11/2band in
Y.
¹
,
Lu
²
,
Ruan
³

et al. 2013
Phys. Rev. C
14
2
28
1
View full text Add to dashboard Cite

First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors

Level structure in the transitional nucleus Tl199

High-spin states inGd152

Evolution of octupole correlations inBa123

Contact Info

Product

Resources

About

X. G. Wu

Lifetime measurements inPt180

Nonaxial shapes in the odd-odd194Au nucleus

Band structures in106Pd

Observation of a secondπh11/2⊗νh11/2band inY.1, Lu2, Ruan3 et al. 2013Phys. Rev. C142281View full textAdd to dashboardCite

First Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount Factors

Level structure in the transitional nucleus Tl199

High-spin states inGd152

Evolution of octupole correlations inBa123

Contact Info

Product

Resources

About

Observation of a secondπh11/2⊗νh11/2band in
Y.
¹
,
Lu
²
,
Ruan
³

et al. 2013
Phys. Rev. C
14
2
28
1
View full text Add to dashboard Cite