2012
DOI: 10.1287/moor.1120.0555
|View full text |Cite
|
Sign up to set email alerts
|

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Abstract: This paper presents sufficient conditions for the existence of stationary optimal policies for average cost Markov decision processes with Borel state and action sets and weakly continuous transition probabilities. The one-step cost functions may be unbounded, and the action sets may be noncompact. The main contributions of this paper are: (i) general sufficient conditions for the existence of stationary discount optimal and average cost optimal policies and descriptions of properties of value functions and se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
183
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
2
1

Relationship

5
4

Authors

Journals

citations
Cited by 103 publications
(183 citation statements)
references
References 38 publications
0
183
0
Order By: Relevance
“…Thus for an MDP an initial state x is considered instead of the initial distribution p. In fact, this MDP possesses a special property that action sets at all the states are equal. For MDPs, Feinberg et al [14] provides general conditions for the existence of optimal policies, validity of optimality equations, and convergence of value iterations. Here we formulate these conditions for an MDP whose action sets in all states are equal.…”
Section: R a F Tmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus for an MDP an initial state x is considered instead of the initial distribution p. In fact, this MDP possesses a special property that action sets at all the states are equal. For MDPs, Feinberg et al [14] provides general conditions for the existence of optimal policies, validity of optimality equations, and convergence of value iterations. Here we formulate these conditions for an MDP whose action sets in all states are equal.…”
Section: R a F Tmentioning
confidence: 99%
“…According to Feinberg et al [14,Corollary 3.2], the real-valued function ψ(x) = inf a∈A c(x, a), x ∈ X <+∞ , with values in R, is inf-compact on X <+∞ . Furthermore, (6.2) implies that X <+∞ ψ(x)z (n) (dx) ≤ λ, n = 1, 2, .…”
Section: The Inequalitiesmentioning
confidence: 99%
“…It is well known that the set of deterministic stationary policies contains optimal policies for a large class of infinite horizon discounted cost problems (see, e.g., [6], [14]) and average cost optimal control problems (see, e.g., [1], [14]). …”
Section: B(a)mentioning
confidence: 99%
“…It is well known that the set of deterministic stationary policies contains an optimal policy for a large class of infinite horizon discounted cost problems (see, e.g., [4], [7]) and average cost optimal control problems (see, e.g., [4]). …”
Section: Markov Decision Processesmentioning
confidence: 99%