Minimising Decision Tree Size as Combinatorial Optimisation

Bessière, Christian; Hébrard, Emmanuel; O’Sullivan, Barry

doi:10.1007/978-3-642-04244-7_16

Cited by 52 publications

(67 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hence, greedy based heuristics such as CART (Breiman et al 1984) and ID3 (Quinlan 1986) have been widely used to construct sub-optimal trees. Recent years have seen an increasing number of work that employ various Mathematical Optimization methods to build better quality decision trees, e.g., (Bennett and Blue 1996;Bessiere, Hebrard, and O'Sullivan 2009;Bertsimas and Dunn 2017;Silva 2017;Dash, Günlük, and Wei 2018;Blanquero et al 2018a;2018b;Firat et al 2018).…”

Section: Introductionmentioning

confidence: 99%

“…An advantage of these Mathematical Optimization based approaches is that they are able to employ the powerful optimization solvers to find decision trees. This power has led to interesting new approaches for learning models and rules, see e.g., (Bessiere, Hebrard, and O'Sullivan 2009;De Raedt, Guns, and Nijssen 2010;Narodytska et al 2018;Verwer, Zhang, and Ye 2017). In addition, the mathematical optimization models allow flexibility on modeling different learning objectives.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Learning Optimal Classification Trees Using a Binary Linear Program Formulation

Verwer

Zhang

2019

AAAI

105

111

View full text Add to dashboard Cite

We provide a new formulation for the problem of learning the optimal classification tree of a given depth as a binary linear program. A limitation of previously proposed Mathematical Optimization formulations is that they create constraints and variables for every row in the training data. As a result, the running time of the existing Integer Linear programming (ILP) formulations increases dramatically with the size of data. In our new binary formulation, we aim to circumvent this problem by making the formulation size largely independent from the training data size. We show experimentally that our formulation achieves better performance than existing formulations on both small and large problem instances within shorter running time.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Learning Optimal Classification Trees Using a Binary Linear Program Formulation

Verwer

Zhang

2019

AAAI

105

111

View full text Add to dashboard Cite

show abstract

“…Constraint Programming has already been shown to be a promising approach for Data Mining through various tasks, such as itemset mining [40][41][42][43][44], skypattern mining [45] or decision tree construction [46].…”

Section: First Model Second Modelmentioning

confidence: 99%

Constrained clustering by constraint programming

Dao

Duong

Vrain

2017

Artificial Intelligence

View full text Add to dashboard Cite

Constrained Clustering allows to make the clustering task more accurate by integrating user constraints, which can be instance-level or cluster-level constraints. Few works consider the integration of different kinds of constraints, they are usually based on declarative frameworks and they are often exact methods, which either enumerate all the solutions satisfying the user constraints, or find a global optimum when an optimization criterion is specified. In a previous work, we have proposed a model for Constrained Clustering based on a Constraint Programming framework. It is declarative, allowing a user to integrate user constraints and to choose an optimization criterion among several ones. In this article we present a new and substantially improved model for Constrained Clustering, still based on a Constraint Programming framework. It differs from our earlier model in the way partitions are represented by means of variables and constraints. It is also more flexible since the number of clusters does not need to be set beforehand; only a lower and an upper bound on the number of clusters have to be provided. In order to make the model-based approach more efficient, we propose new global optimization constraints with dedicated filtering algorithms. We show that such a framework can easily be embedded in a more general process and we illustrate this on the problem of finding the optimal Pareto front of a bi-criterion constrained clustering task. We compare our approach with existing exact approaches, based either on a branch-and-bound approach or on graph coloring on twelve datasets. Experiments show that the model outperforms exact approaches in most cases.

show abstract

“…The objective function (1) maximizes the number of rows correctly predicted, that is, the accuracy of the decision tree. Constraint (2) imposes that exactly one path has to be selected for each leaf. Constraint (3) ensures that each row is directed to exactly one leaf.…”

Section: Decision Variablesmentioning

confidence: 99%