Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021
DOI: 10.18653/v1/2021.findings-acl.294
|View full text |Cite
|
Sign up to set email alerts
|

Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text Classification

Abstract: Existing bias mitigation methods to reduce disparities in model outcomes across cohorts have focused on data augmentation, debiasing model embeddings, or adding fairness-based optimization objectives during training. Separately, certified word substitution robustness methods have been developed to decrease the impact of spurious features and synonym substitutions on model predictions. While their end goals are different, they both aim to encourage models to make the same prediction for certain changes in the i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 13 publications
(16 citation statements)
references
References 38 publications
0
16
0
Order By: Relevance
“…where ∆ TPR (∆ FPR ) is the true positive rate (false negative rate) for sensitive attribute a and TPR overall (FPR overall ) is the overall true positive rate (false negative rate). Following (Pruksachatkun et al, 2021), we define equalized odds gap ∆ EO = ∆ TPR + ∆ FPR , since equalized odds aligns with ∆ TPR + ∆ FPR , and when it is satisfied, ∆ TPR = ∆ FPR = 0 (Borkan et al, 2019). Note that when |Y| > 2, ∆ EO will be summed over each value in Y since TPR and FPR are defined over each class (i.e., ∆ EO = y ∆ y EO ).…”
Section: Methodsmentioning
confidence: 99%
“…where ∆ TPR (∆ FPR ) is the true positive rate (false negative rate) for sensitive attribute a and TPR overall (FPR overall ) is the overall true positive rate (false negative rate). Following (Pruksachatkun et al, 2021), we define equalized odds gap ∆ EO = ∆ TPR + ∆ FPR , since equalized odds aligns with ∆ TPR + ∆ FPR , and when it is satisfied, ∆ TPR = ∆ FPR = 0 (Borkan et al, 2019). Note that when |Y| > 2, ∆ EO will be summed over each value in Y since TPR and FPR are defined over each class (i.e., ∆ EO = y ∆ y EO ).…”
Section: Methodsmentioning
confidence: 99%
“…This task is more difficult within the NLP domain due to the discrete nature of text, but several works (Alzantot et al, 2018;Zhang et al, 2020) have proven successful at inducing model errors. Real-world use of NLP requires resilience to such attacks and our work complements robust training (Parvez et al, 2018) and robust certification (Ye et al, 2020;Pruksachatkun et al, 2021) to produce more reliable models.…”
Section: Related Workmentioning
confidence: 99%
“…And the relevant keywords which categorise the transformation. t/evaluation data splits allows for testing the robustness of models and for identifying possible biases; on the other hand, applying transformations and filters to training data (data augmentation) allows for possibly mitigating the detected robustness and bias issues (Wang et al, 2021b;Pruksachatkun et al, 2021;Si et al, 2021).…”
Section: Format Of a Transformationmentioning
confidence: 99%
“…The resultant tokens are also assigned new tags. Exploiting this transformation has shown to empirically benefit named entity tagging (Yaseen and Langer, 2021) and hence could arguably benefit other lowresource tagging tasks (Bhatt and Dhole, 2020;Khachatrian et al, 2019;Gupta et al, 2021).…”
Section: B9 Backtranslation For Named Entity Recognitionmentioning
confidence: 99%