DP-XGBoost: Private Machine Learning at Scale

Grislain, Nicolas; Gonzalvez, Joan

doi:10.48550/arxiv.2110.12770

Cited by 1 publication

(3 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the rest of this paper, we present algorithms as if the data were held centrally, with the understanding that all the operations we use can be performed in the federated model (with rounding to fixed precision) 2 . This means that we avoid techniques designed for central evaluation such as the exponential mechanism [33,45,65]. Threat Model: In this work, in common with many other works in the federated setting, we assume an honest-but-curious model, where the clients do not trust others with their raw data.…”

Section: The Federated Model Of Computationmentioning

confidence: 99%

“…While this is suitable in non-private settings, it is difficult to calculate such quantiles (or quantile sketches) accurately without incurring an appreciable privacy cost. Existing work on DP-GBDTs has computed split candidates either with LDP quantiles in the local setting [43], DP quantiles in the central setting [33] or with MPC methods (without DP guarantees) in distributed settings [61]. As we assume bounds on features are public knowledge, we do not need to query participants' data, and hence 𝜅 𝑐 = 0.…”

Section: Component 3: Generating Split Candidatesmentioning

confidence: 99%

“…In parallel, many works have studied decision tree models under the central model of DP [25,52,65]. Most studies focus on training random forest (RF) models and there has been little research to explore trade-offs between gradient boosting and DP; those that do often use central DP mechanisms that cannot easily be extended to federated settings [33]. It therefore remains an open problem to implement GBDTs in the federated DP setting, and show how to obtain utility comparable to their centralized non-private counterparts.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations