2019
DOI: 10.1145/3337064
|View full text |Cite
|
Sign up to set email alerts
|

Decision Tree Classification with Differential Privacy

Abstract: Data mining information about people is becoming increasingly important in the data-driven society of the 21st century. Unfortunately, sometimes there are real-world considerations that conflict with the goals of data mining; sometimes the privacy of the people being data mined needs to be considered. This necessitates that the output of data mining algorithms be modified to preserve privacy while simultaneously not ruining the predictive power of the outputted model. Differential privacy is a strong, enforcea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 88 publications
(42 citation statements)
references
References 60 publications
0
37
0
Order By: Relevance
“…The decision tree is a supervised machine learning method. Its basic idea is to classify samples layer by layer by selecting feature attributes and realize an agent based on feature judgment for data classification, feature selection, and other scenarios [ 38 ]. As shown in Figure 2 , the decision tree algorithm will divide the samples layer by layer according to their attribute values and obtain the judgment results under different attribute combinations, thus forming a tree structure.…”
Section: Methodsmentioning
confidence: 99%
“…The decision tree is a supervised machine learning method. Its basic idea is to classify samples layer by layer by selecting feature attributes and realize an agent based on feature judgment for data classification, feature selection, and other scenarios [ 38 ]. As shown in Figure 2 , the decision tree algorithm will divide the samples layer by layer according to their attribute values and obtain the judgment results under different attribute combinations, thus forming a tree structure.…”
Section: Methodsmentioning
confidence: 99%
“…A Classification and Regression Tree (CART) is a popular binary decision tree that can be used for classification or regression analysis [10], [33], [41]. This paper considers to reduce the complexity of the model while increasing the diversity of the model to maintain a certain degree of accuracy of the base classifier.…”
Section: B Classification and Regression Tree 1) Improved Cartmentioning
confidence: 99%
“…Additionally, some tree‐based differentially private classification algorithms have been proposed in the literature (Blum et al, 2005; Fletcher & Islam, 2017, 2019; Jagannathan et al, 2009; Jagannathan, Monteleoni, & Pillaipakkamnatt, 2013; Patil & Singh, 2014; Rana, Gupta, & Venkatesh, 2015). In 2005, a differentially private version of ID3 (Quinlan, 1992), in which the information gain is estimated with the help of output perturbation by adding noise drawn from Laplace distribution to the results of the count queries, has been proposed (Blum et al, 2005).…”
Section: Differentially Private Classificationmentioning
confidence: 99%
“…To provide data security, differential privacy adds random noise drawn from a distribution such as Laplace , to the functions running on sensitive data. There exist three ways to provide differential privacy guarantee: (a) input perturbation (Ji, Lipton, & Elkan, 2014; Mivule, Turner, & Ji, 2012; Sánchez, Domingo‐Ferrer, Martínez, & Soria‐Comas, 2016; Sarwate & Chaudhuri, 2013; Xu, Yang, & Bai, 2019), (b) objective perturbation (Chaudhuri & Monteleoni, 2008; Chaudhuri, Monteleoni, & Sarwate, 2011; Fukuchi, Tran, & Sakuma, 2017; Ji et al, 2014; Rubinstein, Bartlett, Huang, & Taft, 2009; Zhang, Zhang, Xiao, Yang, & Winslett, 2012), and (c) output perturbation (Bojarski, Choromanska, Choromanski, & LeCun, 2014; Fletcher & Islam, 2015, 2019; Friedman & Schuster, 2010; Gursoy, Inan, Nergiz, & Saygin, 2017; Xu et al, 2019). All the three methods add some random noise during the data analysis process to protect individual's privacy.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation