2014
DOI: 10.1109/tit.2014.2298874
|View full text |Cite
|
Sign up to set email alerts
|

Risk Bounds for Embedded Variable Selection in Classification Trees

Abstract: The problems of model and variable selections for classification trees are jointly considered. A penalized criterion is proposed which explicitly takes into account the number of variables, and a risk bound inequality is provided for the tree classifier minimizing this criterion. This penalized criterion is compared to the one used during the pruning step of the CART algorithm. It is shown that the two criteria are similar under some specific margin assumptions. In practice, the tuning parameter of the CART pe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 34 publications
0
5
0
Order By: Relevance
“…For example, some theoretical developments on dyadic partitions of R 2 are given in [4,1], and the VC dimension of axis-parallel cuts appears more particularly in the results obtained on the performance of classification and regression binary decision trees (CART) introduced by Breiman et. al [2] in 1984, and theoretically studied in [8,7,5,6]. In particular, it is to be found in the results of [6] that the VC dimension of axis-parallel cuts is of order log 2 d.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, some theoretical developments on dyadic partitions of R 2 are given in [4,1], and the VC dimension of axis-parallel cuts appears more particularly in the results obtained on the performance of classification and regression binary decision trees (CART) introduced by Breiman et. al [2] in 1984, and theoretically studied in [8,7,5,6]. In particular, it is to be found in the results of [6] that the VC dimension of axis-parallel cuts is of order log 2 d.…”
Section: Introductionmentioning
confidence: 99%
“…al [2] in 1984, and theoretically studied in [8,7,5,6]. In particular, it is to be found in the results of [6] that the VC dimension of axis-parallel cuts is of order log 2 d.…”
Section: Introductionmentioning
confidence: 99%
“…As for the regression case, the properties of the growing algorithm need to be analyzed to obtain full unconditional upper bounds. Results on the performance of theoretical procedures in which CART is viewed as a forward algorithm to approximate an ideal, but intractable, binary tree are given in [13]. Although they do not validate any concrete algorithm as done here, these results confirm that the penalty term used in penalized criterion ( 9) is well chosen under MA(1).…”
Section: Discussionmentioning
confidence: 72%
“…Nevertheless, even if ENT is directly relied to the tree's complexity, taking only ENT into account means forgetting the important compromise to be made between misclassification rate and complexity. It is shown in Gey et al that this compromise is necessary to obtain decision trees of near optimal performance. The authors claim that the tree selected at the end of SF‐GOTA reaches this compromise, but with s ranging from 1 to at most 6, the trees provided by SF‐GOTA are not deep by construction.…”
Section: About Model Selection and Decision Treesmentioning
confidence: 99%