2020
DOI: 10.1287/ijoo.2019.0026
|View full text |Cite
|
Sign up to set email alerts
|

Mining Optimal Policies: A Pattern Recognition Approach to Model Analysis

Abstract: This project spawned from an admission control problem we were working on for a major hospital in the Boston area. We tried to incorporate various aspects of the problem in a model, which resulted in a complex optimization problem that was difficult to solve analytically. Although numerical solutions could be computed, we were looking for insights to characterize simple policies that could be used in practice. We then came up with the idea of using machine learning to analyze solutions as a mean for obtaining … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(8 citation statements)
references
References 42 publications
0
8
0
Order By: Relevance
“…Extract simple heuristic policies from DRL outputs: Instead of leveraging analytical results to improve the numerical performance of DRL, the numerical policies developed by DRL may also spur the development of new, simpler heuristic policies that can be analytically characterized, and are thus easier to implement. A novel approach to convert numerical results into analytic insights is proposed by Bravo & Shaposhnik (2020) . They leverage exact numerical methods to find the optimal value functions for a range of MDPs, and subsequently use the output as input to a machine learning method to extract analytic insights into the structure of the optimal policy for a range of problem domains: inventory management, queuing admission control, multi-armed bandits, and revenue management.…”
Section: Blending Numerical and Analytical Approaches To Optimize Inventory Policiesmentioning
confidence: 99%
See 1 more Smart Citation
“…Extract simple heuristic policies from DRL outputs: Instead of leveraging analytical results to improve the numerical performance of DRL, the numerical policies developed by DRL may also spur the development of new, simpler heuristic policies that can be analytically characterized, and are thus easier to implement. A novel approach to convert numerical results into analytic insights is proposed by Bravo & Shaposhnik (2020) . They leverage exact numerical methods to find the optimal value functions for a range of MDPs, and subsequently use the output as input to a machine learning method to extract analytic insights into the structure of the optimal policy for a range of problem domains: inventory management, queuing admission control, multi-armed bandits, and revenue management.…”
Section: Blending Numerical and Analytical Approaches To Optimize Inventory Policiesmentioning
confidence: 99%
“…This sharply contrasts with the often highly intuitive character of policies obtained via analytic methods, such as for instance base-stock or constant order policies. In addition to developing more intuitive policies from the complex output of neural networks similar to Bravo & Shaposhnik (2020) , we may also develop models that additionally explain why an action is proposed. A vast field exists on explaining and interpreting the output of AI models (see also Gilpin et al, 2018 ).…”
Section: Blending Numerical and Analytical Approaches To Optimize Inventory Policiesmentioning
confidence: 99%
“…Recently, there has been some works toward developing methods for interpretable policies in sequential decision-making(not necessarily specific to the healthcare setting). Bravo and Shaposhnik (2020) propose to explain the optimal unconstrained policies with decision trees, applying their framework to classical operations problems such as queuing control and multi-armed bandit (MAB). However, this may be misleading, as there is no guarantee that the novel explainable, suboptimal policies have the same performance as the unconstrained, optimal policies (Rudin 2019).…”
Section: Related Literaturementioning
confidence: 99%
“…In recent years there has been an emerging interest in combining ideas from machine learning with operations research to develop a framework that uses data to prescribe optimal decisions (Bertsimas and Kallus, 2019;Den Hertog and Postek, 2016;Bravo and Shaposhnik, 2018). Current research focus has been on applying machine learning methodologies to predict the counterfactuals, based on which optimal decisions can be made.…”
Section: The Problem and Related Workmentioning
confidence: 99%