The Convex Relaxation Barrier, Revisited: Tightened Single-Neuron Relaxations for Neural Network Verification

Tjandraatmadja, Christian; Anderson, Ross; Huchette, Joey; Ma, Will; Patel, Krunal; Vielma, Juan Pablo

doi:10.48550/arxiv.2006.14076

Cited by 8 publications

(13 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…MIO has been gaining traction as a tool to solve challenging learning problems. It has been used for example to tackle sparse regression problems (Wilson and Sahinidis 2017, Atamtürk and Gómez 2019, Bertsimas et al 2020b, Hazimeh et al 2020, Xie and Deng 2020, Gómez and Prokopyev 2021, verification of neural networks (Fischetti and Jo 2018, Khalil et al 2018, Tjandraatmadja et al 2020, and sparse principal component analysis (Dey et al 2018, Bertsimas et al 2020a), among others. More importantly in the context of this paper, MIO methods have been proposed to learn optimal decision trees (Bertsimas and Dunn 2017, Verwer and Zhang 2019, Aghaei et al 2019, 2020, Elmachtoub et al 2020, Mišić 2020.…”

Section: Related Workmentioning

confidence: 99%

Learning Optimal Prescriptive Trees from Observational Data

Nathanael¹,

Aghaei²,

Gómez³

et al. 2021

Preprint

View full text Add to dashboard Cite

We consider the problem of learning an optimal prescriptive tree (i.e., a personalized treatment assignment policy in the form of a binary tree) of moderate depth, from observational data. This problem arises in numerous socially important domains such as public health and personalized medicine, where interpretable and data-driven interventions are sought based on data gathered in deployment, through passive collection of data, rather than from randomized trials. We propose a method for learning optimal prescriptive trees using mixed-integer optimization (MIO) technology. We show that under mild conditions our method is asymptotically exact in the sense that it converges to an optimal out-of-sample treatment assignment policy as the number of historical data samples tends to infinity. This sets us apart from existing literature on the topic which either requires data to be randomized or imposes stringent assumptions on the trees. Based on extensive computational experiments on both synthetic and real data, we demonstrate that our asymptotic guarantees translate to significant out-of-sample performance improvements even in finite samples.

show abstract

Section: Related Workmentioning

confidence: 99%

Learning Optimal Prescriptive Trees from Observational Data

Nathanael¹,

Aghaei²,

Gómez³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…These bounds tend to be loose unless optimized during training, which typically comes at a significant cost to standard performance. Further work has aimed to tighten these bounds [7,31,33], however these works focus primarily on small convolutional networks and struggle to scale to more typical deep networks. Other work has studied the limits of these convex relaxations on these small networks using vast amounts of CPU-compute [28].…”

Section: Related Workmentioning

confidence: 99%

“…We defer additional analogous results in the smaller MNIST setting to Appendix E.3. CIFAR10 Much work studying verification of deep networks in the CIFAR10 setting [7,31,33], including the more scalable Lagrangian-based methods [2,10], have focused primarily on a CNN with only 6k hidden units from [36]-smaller than the LeNet architecture used for MNIST [21]. Consequently, the LP relaxation for this network can solved exactly with a commercial LP solver such as Gurobi, and recent SDP solvers can produce even tighter bounds [6], rendering any LPbased solutions for this network obsolete.…”

Section: Improved Bounds From Exact Lp Solutionsmentioning

confidence: 99%

DeepSplit: Scalable Verification of Deep Neural Networks via Operator Splitting

Chen¹,

Wong²,

Kolter³

et al. 2021

Preprint

View full text Add to dashboard Cite

Analyzing the worst-case performance of deep neural networks against input perturbations amounts to solving a large-scale non-convex optimization problem, for which several past works have proposed convex relaxations as a promising alternative. However, even for reasonably-sized neural networks, these relaxations are not tractable, and so must be replaced by even weaker relaxations in practice. In this work, we propose a novel operator splitting method that can directly solve a convex relaxation of the problem to high accuracy, by splitting it into smaller sub-problems that often have analytical solutions. The method is modular and scales to problem instances that were previously impossible to solve exactly due to their size. Furthermore, the solver operations are amenable to fast parallelization with GPU acceleration. We demonstrate our method in obtaining tighter bounds on the worst-case performance of large convolutional networks in image classification and reinforcement learning settings.

show abstract

“…They leverage MP to optimize a mixed-integer (linear) problem (MIP) over a polyhedral action space using commercially available solvers such as CPLEX and Gurobi. A number of papers show how trained ReLU-based DNNs can be expressed as an MP with (Tjandraatmadja et al 2020;Anderson et al 2020) also providing ideal reformulations that improve computational efficiencies with a solver. (Ryu et al 2019) propose a Q-learning framework to optimize over continuous action spaces using a combination of MP and a DNN actor.…”

Section: Literature Reviewmentioning

confidence: 99%

“…(Ryu et al 2019) propose a Q-learning framework to optimize over continuous action spaces using a combination of MP and a DNN actor. (Delarue, Anderson, and Tjandraatmadja 2020;van Heeswijk and La Poutré 2019;Xu et al 2020) show how to use ReLU-based DNN value functions to optimize combinatorial problems (e.g., vehicle routing) where the immediate rewards are deterministic and the action space is vast. We extend such approaches and results to problems where the immediate reward can be uncertain as is the case with inventory management problems.…”

Section: Literature Reviewmentioning

confidence: 99%

Deep Policy Iteration with Integer Programming for Inventory Management

Harsha¹,

Jagmohan²,

Kalagnanam³

et al. 2021

Preprint

View full text Add to dashboard Cite

Reinforcement learning has lead to considerable breakthroughs in diverse areas such as robotics, games and many others. But the application to RL in complex real-world decision making problems remains limited. Many problems in operations management (inventory and revenue management, for example) are characterized by large action spaces and stochastic system dynamics. These characteristics make the problem considerably harder to solve for existing RL methods that rely on enumeration techniques to solve per step action problems. To resolve these issues, we develop Programmable Actor Reinforcement Learning (PARL), a policy iteration method that uses techniques from integer programming and sample average approximation. Analytically, we show that the for a given critic, the learned policy in each iteration converges to the optimal policy as the underlying samples of the uncertainty go to infinity. Practically, we show that a properly selected discretization of the underlying uncertain distribution can yield near optimal actor policy even with very few samples from the underlying uncertainty. We then apply our algorithm to real-world inventory management problems with complex supply chain structures and show that PARL outperforms state-of-the-art RL and inventory optimization methods in these settings. We find that PARL outperforms commonly used base stock heuristic by 44.7% and the best performing RL method by up to 12.1% on average across different supply chain environments.

show abstract

The Convex Relaxation Barrier, Revisited: Tightened Single-Neuron Relaxations for Neural Network Verification

Cited by 8 publications

References 23 publications

Learning Optimal Prescriptive Trees from Observational Data

Learning Optimal Prescriptive Trees from Observational Data

DeepSplit: Scalable Verification of Deep Neural Networks via Operator Splitting

Deep Policy Iteration with Integer Programming for Inventory Management

Contact Info

Product

Resources

About