Wiley Encyclopedia of Operations Research and Management Science 2011
DOI: 10.1002/9780470400531.eorms0906
|View full text |Cite
|
Sign up to set email alerts
|

Total Expected Discounted Reward MDPS : Existence of Optimal Policies

Abstract: This article describes the results on the existence of optimal and nearly optimal policies for Markov Decision Processes (MDPs) with total expected discounted rewards. The problem of optimization of total expected discounted rewards for MDPs is also known under the name of discounted dynamic programming.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 17 publications
0
1
0
Order By: Relevance
“…Such arguments are standard in the literature on MDP and infinite-horizon inventory control problems (cf. Iglehart (1963), Sennott (1989), Schäl (1993), Fleischmann and Kuik (2003), Feinberg (2011), Huh, Janakiraman and Nagarajan ( 2011)). We note that the somewhat non-standard aspect here is that the demand in each period is distributed as D − r L , and thus may be negative.…”
Section: Proof Of Theoremmentioning
confidence: 99%
“…Such arguments are standard in the literature on MDP and infinite-horizon inventory control problems (cf. Iglehart (1963), Sennott (1989), Schäl (1993), Fleischmann and Kuik (2003), Feinberg (2011), Huh, Janakiraman and Nagarajan ( 2011)). We note that the somewhat non-standard aspect here is that the demand in each period is distributed as D − r L , and thus may be negative.…”
Section: Proof Of Theoremmentioning
confidence: 99%