Findings of the Association for Computational Linguistics: EMNLP 2022 2022
DOI: 10.18653/v1/2022.findings-emnlp.47
|View full text |Cite
|
Sign up to set email alerts
|

Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 0 publications
0
0
0
Order By: Relevance
“…The intuition is that the representations of the poisoned samples should be dissimilar to those of the clean ones. Regarding test-stage defences, one can leverage either the victim model (Gao et al, 2019;Yang et al, 2021;Chen et al, 2022b) or an external model (Qi et al, 2021a) to filter out the malicious inputs according to their misbehaviour. Our approach belongs to the family of training-stage defences.…”
Section: Related Workmentioning
confidence: 99%
“…The intuition is that the representations of the poisoned samples should be dissimilar to those of the clean ones. Regarding test-stage defences, one can leverage either the victim model (Gao et al, 2019;Yang et al, 2021;Chen et al, 2022b) or an external model (Qi et al, 2021a) to filter out the malicious inputs according to their misbehaviour. Our approach belongs to the family of training-stage defences.…”
Section: Related Workmentioning
confidence: 99%