2007
DOI: 10.1287/moor.1060.0224
|View full text |Cite
|
Sign up to set email alerts
|

Solution and Forecast Horizons for Infinite-Horizon Nonhomogeneous Markov Decision Processes

Abstract: We consider the problem of solving a nonhomogeneous infinite horizon Markov Decision Process (MDP) problem in the general case of potentially multiple optimal first period policies. More precisely, we seek an algorithm that, given a finite subset of the problem's potentially infinite data set, delivers an optimal first period policy. Such an algorithm can thus recursively generate, within a rolling horizon procedure, an infinite horizon optimal solution to the original infinite horizon problem. However it can … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 19 publications
(16 citation statements)
references
References 23 publications
(20 reference statements)
0
16
0
Order By: Relevance
“…This theorem only establishes the existence of iterations K n with the stated property -we cannot tell whether we have reached K n since it is not possible in general to finitely establish optimality of early decisions in nonstationary MDPs [13].…”
Section: Setmentioning
confidence: 98%
See 4 more Smart Citations
“…This theorem only establishes the existence of iterations K n with the stated property -we cannot tell whether we have reached K n since it is not possible in general to finitely establish optimality of early decisions in nonstationary MDPs [13].…”
Section: Setmentioning
confidence: 98%
“…For each (n, s, a), let J n (s, a) be the set of nodes in stage n + 1 that are reachable on choosing action a in node s in stage n. That is, J n (s, a) = {s ∈ S : p n (s |s, a) > 0}. (13) Then, the hyperarc corresponding to action a ∈ A that emanates from the node representing state s ∈ S in stage n ∈ N has |J n (s, a)| "heads". Furthermore, the flow reaching from node s to node s ∈ J n (s, a) equals p n (s |s, a)x n (s, a).…”
Section: A Cilp Formulation Of Nonstationary Mdpsmentioning
confidence: 99%
See 3 more Smart Citations