MIPaaL: Mixed Integer Program as a Layer

Ferber, Aaron; Wilder, Bryan; Dilkina, Bistra; Tambe, Milind

doi:10.1609/aaai.v34i02.5509

Cited by 66 publications

(58 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In case of problems with weaker relaxations, one could consider adding cutting planes prior to solving (Ferber et al 2019). Moreover, further improvements could be achieved by exploiting the fact that all previously computed solutions are valid candidates.…”

Section: Discussionmentioning

confidence: 99%

“…In case of weak MIP relaxations, one can also use a cutting plane algorithm in the root node and use the resulting tighter relaxation thereof (Ferber et al 2019). Other weaker oracles could also be used, for example setting a time-limit on an any-time solver and using the best solution found, or a node-limit on search algorithms.…”

Section: Combinatorial Problems and Scaling Upmentioning

confidence: 99%

See 1 more Smart Citation

Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems

Mandi

Demirović

Stuckey

et al. 2020

AAAI

View full text Add to dashboard Cite

Combinatorial optimization assumes that all parameters of the optimization problem, e.g. the weights in the objective function, are fixed. Often, these weights are mere estimates and increasingly machine learning techniques are used to for their estimation. Recently, Smart Predict and Optimize (SPO) has been proposed for problems with a linear objective function over the predictions, more specifically linear programming problems. It takes the regret of the predictions on the linear problem into account, by repeatedly solving it during learning. We investigate the use of SPO to solve more realistic discrete optimization problems. The main challenge is the repeated solving of the optimization problem. To this end, we investigate ways to relax the problem as well as warm-starting the learning and the solving. Our results show that even for discrete problems it often suffices to train by solving the relaxation in the SPO loss. Furthermore, this approach outperforms the state-of-the-art approach of Wilder, Dilkina, and Tambe. We experiment with weighted knapsack problems as well as complex scheduling problems, and show for the first time that a predict-and-optimize approach can successfully be used on large-scale combinatorial optimization problems.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Combinatorial Problems and Scaling Upmentioning

confidence: 99%

Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems

Mandi

Demirović

Stuckey

et al. 2020

AAAI

View full text Add to dashboard Cite

show abstract

“…Following this idea, existing work developed implicit layers of argmin in neural network, including OptNet [13] for quadratic programs (QP) problems and CVXPY [14] for more general convex optimization problems. Further with linear relaxation and QP regularization, Wilder et al derived an end-to-end framework for combinatorial programs [9], which accelerates the computation by leverage the low-rank properties of decision vectors [3], and is further extended to mixed integer linear programs in MIPaaL [27]. Besides, for the relaxed LP problems, instead of differentiating KKT conditions, IntOpt [28] proposes an interior point based approach which computes gradients by differentiating homogeneous self-dual formulation.…”

Section: Related Workmentioning

confidence: 99%

A Surrogate Objective Framework for Prediction+Optimization with Soft Constraints

Yan

Luo

et al. 2021

Preprint

View full text Add to dashboard Cite

Prediction+optimization is a common real-world paradigm where we have to predict problem parameters before solving the optimization problem. However, the criteria by which the prediction model is trained are often inconsistent with the goal of the downstream optimization problem. Recently, decision-focused prediction approaches, such as SPO+ and direct optimization, have been proposed to fill this gap. However, they cannot directly handle the soft constraints with the max operator required in many real-world objectives. This paper proposes a novel analytically differentiable surrogate objective framework for real-world linear and semi-definite negative quadratic programming problems with soft linear and non-negative hard constraints. This framework gives the theoretical bounds on constraints' multipliers, and derives the closed-form solution with respect to predictive parameters and thus gradients for any variable in the problem. We evaluate our method in three applications extended with soft constraints: synthetic linear programming, portfolio optimization, and resource provisioning, demonstrating that our method outperforms traditional two-staged methods and other decision-focused approaches.

show abstract

“…A series of works about differentiating through CO problems , Elmachtoub and Grigas, 2020, Ferber et al, 2020 relax ILPs by adding L 1 , L 2 or log-barrier regularization terms and differentiate through the KKT conditions deriving from the application of the cutting plane or the interior-point methods. These approaches are conceptually linked to techniques for differentiating through smooth programs , Donti et al, 2017, Agrawal et al, 2019, Chen et al, 2020, Domke, 2012, Franceschi et al, 2018 that arise not only in modelling but also in hyperparameter optimization and meta-learning.…”

Section: Related Workmentioning

confidence: 99%

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

Niepert¹,

Minervini²,

Franceschi³

2021

Preprint

View full text Add to dashboard Cite

Integrating discrete probability distributions and combinatorial optimization problems into neural networks has numerous applications but poses several challenges. We propose Implicit Maximum Likelihood Estimation (I-MLE), a framework for end-to-end learning of models combining discrete exponential family distributions and differentiable neural components. I-MLE is widely applicable: it only requires the ability to compute the most probable states; and does not rely on smooth relaxations. The framework encompasses several approaches, such as perturbation-based implicit differentiation and recent methods to differentiate through black-box combinatorial solvers. We introduce a novel class of noise distributions for approximating marginals via perturb-and-MAP. Moreover, we show that I-MLE simplifies to maximum likelihood estimation when used in some recently studied learning settings that involve combinatorial solvers. Experiments on several datasets suggest that I-MLE is competitive with and often outperforms existing approaches which rely on problem-specific relaxations.Preprint. Under review.

show abstract

MIPaaL: Mixed Integer Program as a Layer

Cited by 66 publications

References 18 publications

Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems

Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems

A Surrogate Objective Framework for Prediction+Optimization with Soft Constraints

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

Contact Info

Product

Resources

About