The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate

Ye, Yinyu

doi:10.1287/moor.1110.0516

Cited by 144 publications

(176 citation statements)

References 25 publications

(30 reference statements)

Supporting

Mentioning

170

Contrasting

Order By: Relevance

“…On another front, the interesting polynomial simplex-like algorithm of Kelner and Spielman [20] does not settle Conjecture 2 because it is weakly polynomial, as the complexity of each iteration, and the number of iterations, depends (polynomially of course) on the bits of the integers in the input. Other recent results related to Conjecture 1 can be found in [6,35].…”

Section: Conjecture 1 There Is a Strongly Polynomial Algorithm For Lmentioning

confidence: 83%

On Simplex Pivoting Rules and Complexity Theory

Adler

Papadimitriou

Rubinstein

2014

Integer Programming and Combinatorial Optimization

View full text Add to dashboard Cite

Abstract. We show that there are simplex pivoting rules for which it is PSPACE-complete to tell if a particular basis will appear on the algorithm's path. Such rules cannot be the basis of a strongly polynomial algorithm, unless P = PSPACE. We conjecture that the same can be shown for most known variants of the simplex method. However, we also point out that Dantzig's shadow vertex algorithm has a polynomial path problem. Finally, we discuss in the same context randomized pivoting rules.

show abstract

Section: Conjecture 1 There Is a Strongly Polynomial Algorithm For Lmentioning

confidence: 83%

On Simplex Pivoting Rules and Complexity Theory

Adler

Papadimitriou

Rubinstein

2014

Integer Programming and Combinatorial Optimization

View full text Add to dashboard Cite

show abstract

“…By Lemma 4.12, we know thatδ(s, a, N (u k )) → 0 as k → ∞. We will also show that γ u k ,N (u k ) (s, a) becomes nonpositive as k → ∞, which will contradict (25), and thus, we will conclude thatȳ is feasible to (P).…”

Section: Proofmentioning

confidence: 67%

“…Recently, complexity of the simplex method with the Dantzig's pivoting rule for finite-state MDPs was studied in [25]. If one can derive a number of iterations (or computational complexity) for the simplex algorithm for countable-state MDPs to find a policy whose value function is within a given threshold from the optimal value function, then it would be possible to compare the convergence rates of the algorithms for countable-state MDPs by comparing the result for the simplex algorithm to the ones in [23,21].…”

Section: Discussion and Future Researchmentioning

confidence: 99%

“…It is well known that policy iteration, one of the popular solution methods for MDPs with finite state space, can be viewed as the simplex method applied to an equivalent LP formulation of the MDP. A recent result in [25] showed that for finite-state MDPs, simplex method with Dantzig's pivoting rule (for maximization, choosing a non-basic variable with the most positive reduced cost) is strongly polynomial for a fixed discount factor, and the complexity bound is better than that of the other solution methods.…”

Section: Motivation and Contributionmentioning

confidence: 99%

See 1 more Smart Citation

Simplex Algorithm for Countable-State Discounted Markov Decision Processes

Lee

Epelman

Romeijn

et al. 2017

Operations Research

View full text Add to dashboard Cite

We consider discounted Markov Decision Processes (MDPs) with countably-infinite state spaces, finite action spaces, and unbounded rewards. Typical examples of such MDPs are inventory management and queueing control problems in which there is no specific limit on the size of inventory or queue. Existing solution methods obtain a sequence of policies that converges to optimality in value but may not improve monotonically, i.e., a policy in the sequence may be worse than preceding policies. Our proposed approach considers countably-infinite linear programming (CILP) formulations of the MDPs (a CILP is defined as a linear program (LP) with countably-infinite numbers of variables and constraints). Under standard assumptions for analyzing MDPs with countably-infinite state spaces and unbounded rewards, we extend the major theoretical extreme point and duality results to the resulting CILPs. Under an additional technical assumption which is satisfied by several applications of interest, we present a simplextype algorithm that is implementable in the sense that each of its iterations requires only a finite amount of data and computation. We show that the algorithm finds a sequence of policies which improves monotonically and converges to optimality in value. Unlike existing simplex-type algorithms for CILPs, our proposed algorithm solves a class of CILPs in which each constraint may contain an infinite number of variables and each variable may appear in an infinite number of constraints. A numerical illustration for inventory management problems is also presented.

show abstract

“…It should be remarked that the diameter of the resulting polytopes is actually smaller than the Hirsch bound. Curiously, Ye (2011) showed that the simplex method using Dantzig's pivot rule (where one chooses the entering variable with the largest reduced cost coefficient). is strongly polynomial for the linear programs derived from Markov Decision Processes with Fixed Discount (which is not the setting for the other papers, but is an important case of MDPs).…”

Section: Comment 2: We Still Need To Work Harder To Understand the Gementioning

confidence: 99%

Comments on: Recent progress on the combinatorial diameter of polytopes and simplicial complexes

Loera

2013

TOP

View full text Add to dashboard Cite

The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate

Cited by 144 publications

References 25 publications

On Simplex Pivoting Rules and Complexity Theory

On Simplex Pivoting Rules and Complexity Theory

Simplex Algorithm for Countable-State Discounted Markov Decision Processes

Comments on: Recent progress on the combinatorial diameter of polytopes and simplicial complexes

Contact Info

Product

Resources

About