Defending Against Neural Network Model Stealing Attacks Using Deceptive Perturbations

Lee, Tae-Sung; Edwards, Benjamin; Molloy, Ian; Su, Dong

doi:10.1109/spw.2019.00020

Cited by 78 publications

(57 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A first defense to model extraction is to reduce the amount of information given to an adversary by modifying the model prediction. Prediction probabilities can be quantized [55] or perturbed to deceive to the adversary [29]. We have shown that model extraction attacks are effective even without using prediction probabilities (Sect.…”

Section: B Defenses Against Model Extractionmentioning

confidence: 99%

PRADA: Protecting Against DNN Model Stealing Attacks

Juuti

Szyller

Marchal

et al. 2019

2019 IEEE European Symposium on Security and Privacy (EuroS&P)

289

254

View full text Add to dashboard Cite

Machine learning (ML) applications are increasingly prevalent. Protecting the confidentiality of ML models becomes paramount for two reasons: (a) a model can be a business advantage to its owner, and (b) an adversary may use a stolen model to find transferable adversarial examples that can evade classification by the original model. Access to the model can be restricted to be only via well-defined prediction APIs. Nevertheless, prediction APIs still provide enough information to allow an adversary to mount model extraction attacks by sending repeated queries via the prediction API.In this paper, we describe new model extraction attacks using novel approaches for generating synthetic queries, and optimizing training hyperparameters. Our attacks outperform state-of-theart model extraction in terms of transferability of both targeted and non-targeted adversarial examples (up to +29-44 percentage points, pp), and prediction accuracy (up to +46 pp) on two datasets. We provide take-aways on how to perform effective model extraction attacks.We then propose PRADA, the first step towards generic and effective detection of DNN model extraction attacks. It analyzes the distribution of consecutive API queries and raises an alarm when this distribution deviates from benign behavior. We show that PRADA can detect all prior model extraction attacks with no false positives.

show abstract

Section: B Defenses Against Model Extractionmentioning

confidence: 99%

PRADA: Protecting Against DNN Model Stealing Attacks

Juuti

Szyller

Marchal

et al. 2019

2019 IEEE European Symposium on Security and Privacy (EuroS&P)

289

254

View full text Add to dashboard Cite

show abstract

“…Tramer et al [15] and Lee et al [100] suggested that the efficiency of the model extraction attack can be decreased by omitting the confidence value or adding smart noise to the predicted probabilities. However, Juuti et al [101] shown that model extraction is effective, even omitting prediction probabilities.…”

Section: ) Miscellaneous Defensementioning

confidence: 99%

Privacy and Security Issues in Deep Learning: A Survey

et al. 2021

View full text Add to dashboard Cite

“…Confidence rounding and ensemble model were shown effective against equationsolving extractions in [1]. Lee et al [27] proposed perturbations using the mechanism of reverse sigmoid to inject deceptive noises to output confidence, which preserved the validity of top and bottom rank labels. Kesarwani et al [6] monitored user-server streams to evaluate the threat level of model extraction with two strategies based on entropy and compact model summaries.…”

Section: Model Extractionmentioning

confidence: 99%

Protecting Decision Boundary of Machine Learning Model With Differentially Private Perturbation

Zheng

et al. 2022

IEEE Trans. Dependable and Secure Comput.

View full text Add to dashboard Cite

Defending Against Neural Network Model Stealing Attacks Using Deceptive Perturbations

Cited by 78 publications

References 11 publications

PRADA: Protecting Against DNN Model Stealing Attacks

PRADA: Protecting Against DNN Model Stealing Attacks

Privacy and Security Issues in Deep Learning: A Survey

Protecting Decision Boundary of Machine Learning Model With Differentially Private Perturbation

Contact Info

Product

Resources

About