2020
DOI: 10.1609/aaai.v34i05.6267
|View full text |Cite
|
Sign up to set email alerts
|

On Measuring and Mitigating Biased Inferences of Word Embeddings

Abstract: Word embeddings carry stereotypical connotations from the text they are trained on, which can lead to invalid inferences in downstream models that rely on them. We use this observation to design a mechanism for measuring stereotypes using the task of natural language inference. We demonstrate a reduction in invalid inferences via bias mitigation strategies on static word embeddings (GloVe). Further, we show that for gender bias, these techniques extend to contextualized embeddings when applied selectively only… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
156
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 88 publications
(174 citation statements)
references
References 17 publications
1
156
0
Order By: Relevance
“…Arguably this setting is more natural, as it better aligns with how systems are used in real life. Several notable examples are coreference resolution (Rudinger et al, 2018;Zhao et al, 2018;Kurita et al, 2019), machine translation (Stanovsky et al, 2019;Cho et al, 2019), textual entailment (Dev et al, 2020a), language generation (Sheng et al, 2019), or clinical classification (Zhang et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Arguably this setting is more natural, as it better aligns with how systems are used in real life. Several notable examples are coreference resolution (Rudinger et al, 2018;Zhao et al, 2018;Kurita et al, 2019), machine translation (Stanovsky et al, 2019;Cho et al, 2019), textual entailment (Dev et al, 2020a), language generation (Sheng et al, 2019), or clinical classification (Zhang et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…Such studies on model bias have led to many bias mitigation techniques (e.g., Bolukbasi et al, 2016b;Dev et al, 2020a;Ravfogel et al, 2020;Dev et al, 2020b). In this work, we focus on exploring biases across QA models and expect that our framework could also help future efforts on bias mitigation.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations