Multiple Objective Nonatomic Markov Decision Processes with Total Reward Criteria

Feinberg, Eugene A.; Piunovskiy, Alexey

doi:10.1006/jmaa.2000.6819

Cited by 10 publications

(11 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It seems that establishing this characterization result could be quite involving, especially for general CTMDP models in Borel spaces. Instead, like in [12,13] and [36] for discrete-time and continuous-time problems with total undiscounted and discounted criteria and [31] focusing on the performance analysis of queueing networks, we pass the average constrained CTMDP problem from the infinite dimensional framework (in the space of measures) to the finite dimensional framework by investigating the space of performance vectors.…”

Section: Introductionmentioning

confidence: 99%

Optimality of Mixed Policies for Average Continuous-Time Markov Decision Processes with Constraints

Guo¹,

Zhang²

2016

Mathematics of OR

View full text Add to dashboard Cite

This article concerns the average criteria for continuous-time Markov decision processes with N constraints, Under some suitable conditions allowing the transition rates to be possibly unbounded, and the cost rates to be unbounded from both above and below, we establish the following; (a) every extreme point of the space of performance vectors corresponding to the set of stable measures is generated by a deterministic stationary policy; and (b) there exists a mixed optimal policy, where the mixture is over no more than N + 1 deterministic stationary policies.

show abstract

Section: Introductionmentioning

confidence: 99%

Optimality of Mixed Policies for Average Continuous-Time Markov Decision Processes with Constraints

Guo¹,

Zhang²

2016

Mathematics of OR

View full text Add to dashboard Cite

show abstract

“…This result was established in [7] as a corollary of the following fact: the performance set for nonrandomized Markov policies coincides with Research of this coauthor was partially supported by NSF Grant DMI-9908258 with the performance set for all policies in a nonatomic MDP satisfying continuity and compactness conditions. Continuity and compactness conditions were essential for the proofs in Feinberg and Piunovskiy [7].…”

Section: Introductionmentioning

confidence: 81%

“…Lemma 9 in Feinberg and Piunovskiy [7] implies that there is a poligy y in the new MDP such that fi"(P7) = Rn(P") and RN+"(p7) = Rn (Pn, T ) Consider a sequence Ek \ 0. We define TI > 0 such that for all n = 1 , .…”

Section: Lemma 3 For Any Policy W Andmentioning

confidence: 99%

“…Feinberg and Piunovskiy [7] proved that if a nonatomic MDP satisfies continuity and compactness conditions then there exists an optimal nonrandomized Markov policy for a multiple criterion problem with constraints. This result was established in [7] as a corollary of the following fact: the performance set for nonrandomized Markov policies coincides with Research of this coauthor was partially supported by NSF Grant DMI-9908258 with the performance set for all policies in a nonatomic MDP satisfying continuity and compactness conditions. Continuity and compactness conditions were essential for the proofs in Feinberg and Piunovskiy [7].…”

Section: Introductionmentioning

confidence: 99%

“…To do it, we consider a one-step MDP, introduced in Feinberg and Piunovskiy [7, Section 41, with the state space X, set of actions D, sets U($) of available actions at states z E X , where D is the set of all strategic measures in the original MDP and U ( z ) is the set of all strategic measures in the original MDP such that:(a) the initial distribution is concentrated at z, (b) the policy is nonrandomized at step 0. By Lemma 8 in Feinberg and Piunovskiy[7], sets U ( z ) and the set of all strategic measures U, generated by policies nonrandomized at step 0, are measurable. One can prove also that the graph of U is measurable.…”

mentioning

confidence: 98%

See 2 more Smart Citations

Nonatomic total rewards Markov decision processes with multiple criteria

Feinberg

Piunovskiy

Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187)

View full text Add to dashboard Cite

We consider a Markov decision process with an uncountable state space for which the vector performance functional has the form of expected total rewards. Under the single condition that the fixed initial distribution and transition probabilities are nonatomic, we prove that for any policy there is a nonrandomized Markov policy such that for these two policies the performance vectors are equal. Shreve [3], Dynkin and Yushkevich [5], or Feinberg and Piunovskiy [7] for details. Our proofs are based on the following result. Lyapunov's Theorem (Barra [ 2] ) Let { P I , Pz, ..., P N } be

show abstract

Constrained Discounted Semi-Markov Decision Processes

Feinberg

2002

Markov Processes and Controlled Markov Chains

View full text Add to dashboard Cite

Multiple Objective Nonatomic Markov Decision Processes with Total Reward Criteria

Cited by 10 publications

References 22 publications

Optimality of Mixed Policies for Average Continuous-Time Markov Decision Processes with Constraints

Optimality of Mixed Policies for Average Continuous-Time Markov Decision Processes with Constraints

Nonatomic total rewards Markov decision processes with multiple criteria

Constrained Discounted Semi-Markov Decision Processes

Contact Info

Product

Resources

About