Monotonicity in Markov Reward and Decision Chains: Theory and Applications

Koole, Ger

doi:10.1561/0900000002

Cited by 73 publications

(51 citation statements)

References 54 publications

(94 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A standard approach for studying the optimal policies of MDPs is to explore the first-and/or second-order properties of the optimal cost function (see Koole 2006). Optimal cost functions for multivariate MDPs (like ours) are typically shown to be convex in each dimension of the state space.…”

Section: Introductionmentioning

confidence: 99%

Technical Note—Optimal Structural Results for Assemble-to-Order Generalized M-Systems

Nadar

Akan

Scheller‐Wolf

2014

Operations Research

View full text Add to dashboard Cite

We consider an assemble-to-order generalized M-system with multiple components and multiple products, batch ordering of components, random lead times, and lost sales. We model the system as an infinite-horizon Markov decision process and seek an optimal policy that specifies when a batch of components should be produced (i.e., inventory replenishment) and whether an arriving demand for each product should be satisfied (i.e., inventory allocation). We characterize optimal inventory replenishment and allocation policies under a mild condition on component batch sizes via a new type of policy: lattice-dependent base stock and lattice-dependent rationing.

show abstract

Section: Introductionmentioning

confidence: 99%

Technical Note—Optimal Structural Results for Assemble-to-Order Generalized M-Systems

Nadar

Akan

Scheller‐Wolf

2014

Operations Research

View full text Add to dashboard Cite

show abstract

“…For both models, we show via dynamic programming that (i) the optimal allocation policy has a work-conservation property that implies when the system is not empty, the optimal policy is not allowed to keep all computing resources idle, (ii) the optimal number of servers follows a step function with as extreme policy the bang-bang control policy, which means a facility receives all computing resources or none at all, and moreover (iii) we also provide the conditions under which the bang-bang control policy is optimal. The techniques to prove such results are based on monotonicity properties of the dynamic programming relative value function (see, e.g., Koole 1998Koole , 2006Rykov 2001).…”

Section: Introductionmentioning

confidence: 99%

Structural properties of the optimal resource allocation policy for single-queue systems

Yang¹,

Bhulai²,

Mei³

2011

Ann Oper Res

View full text Add to dashboard Cite

This paper studies structural properties of the optimal resource allocation policy for single-queue systems. Jobs arrive at a service facility and are sent one by one to a pool of computing resources for parallel processing. The facility poses a constraint on the maximum expected sojourn time of a job. A central decision maker allocates the servers dynamically to the facility. We consider two models: a limited resource allocation model, where the allocation of resources can only be changed at the start of a new service, and a fully flexible allocation model, where the allocation of resources can also change during a service period. In these two models, the objective is to minimize the average utilization costs whilst satisfying the time constraint. To this end, we cast these optimization problems as Markov decision problems and derive structural properties of the relative value function. We show via dynamic programming that (1) the optimal allocation policy has a work-conservation property, and (2) the optimal number of servers follows a step function with as extreme policy the bang-bang control policy. Moreover, (3) we provide conditions under which the bang-bang control policy takes place. These properties give a full characterization of the optimal policy, which are illustrated by numerical experiments.

show abstract

“…To analyze the value function V n (x, y) in §4.4.1.3, we employ the event based dynamic programming approach introduced by Koole (1998Koole ( , 2006. To this end, let V denote the set of all functions v : S → R and let f, f 1 , ..., f m+2 ∈ V. We define the following operators…”

Section: Mdp Formulation With Bounded Transition Ratesmentioning

confidence: 99%

“…These operators are variations to operators defined by Koole (1998Koole ( , 2004Koole ( , 2006 and are originally intended to model various common queueing mechanisms such as arrival control (T AC(i) ), transfer departures from multi-server tandem queues (T TD(i) ), and departures from multi-server queues (T D(i) ), while the operators T cost f (x, y), T env and T unif are mainly convenient for bookkeeping. The Bellman recursion for our MDP, (4.4), can now be written succinctly as…”

Section: Mdp Formulation With Bounded Transition Ratesmentioning

confidence: 99%

See 1 more Smart Citation

Spare Parts Planning and Control for Maintenance Operations

Arts¹

Proceedings of the Second International Conference on Railway Technology: Research, Development and Maintenance

View full text Add to dashboard Cite

Monotonicity in Markov Reward and Decision Chains: Theory and Applications

Cited by 73 publications

References 54 publications

Technical Note—Optimal Structural Results for Assemble-to-Order Generalized M-Systems

Technical Note—Optimal Structural Results for Assemble-to-Order Generalized M-Systems

Structural properties of the optimal resource allocation policy for single-queue systems

Spare Parts Planning and Control for Maintenance Operations

Contact Info

Product

Resources

About