2020
DOI: 10.1609/aaai.v34i04.5888
|View full text |Cite
|
Sign up to set email alerts
|

Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation

Abstract: Recently, techniques have been developed to provably guarantee the robustness of a classifier to adversarial perturbations of bounded L1 and L2 magnitudes by using randomized smoothing: the robust classification is a consensus of base classifications on randomly noised samples where the noise is additive. In this paper, we extend this technique to the L0 threat model. We propose an efficient and certifiably robust defense against sparse adversarial attacks by randomly ablating input features, rather than using… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
113
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 69 publications
(113 citation statements)
references
References 8 publications
0
113
0
Order By: Relevance
“…Graph is essentially binary data, i.e., a pair of nodes can be either connected or unconnected. For binary data, a randomized smoothing method called randomized subsampling [27] achieves state-of-the-art certified robustness. Therefore, we design our certified defense based on randomized subsampling.…”
Section: Overviewmentioning
confidence: 99%
See 3 more Smart Citations
“…Graph is essentially binary data, i.e., a pair of nodes can be either connected or unconnected. For binary data, a randomized smoothing method called randomized subsampling [27] achieves state-of-the-art certified robustness. Therefore, we design our certified defense based on randomized subsampling.…”
Section: Overviewmentioning
confidence: 99%
“…Randomized smoothing: Randomized smoothing [4,10,19,24,25,27,28,32] is state-of-the-art technique to build provably robust machine learning. Compared with other certified defense mechanisms, randomized smoothing has two key advantages: 1) scalable to large neural networks, and 2) applicable to arbitrary classifiers.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Our defense strategy consists of two parts: image ablation and certificate retraining crowd counting models. The first step is inspired by the recent advance in image classifier certification [23]. Specifically, randomized ablation is effective against APAM attacks because the ablation results of normal image 𝑥 and adversarially perturbed image x are likely to be same (e.g., retaining 45 pixels for each images in Fig.…”
Section: Introductionmentioning
confidence: 99%