On Measuring and Mitigating Biased Inferences of Word Embeddings

Dev, Sunipa; Li, Tao; Phillips, Jeff M.; Srikumar, Vivek

doi:10.1609/aaai.v34i05.6267

Cited by 88 publications

(174 citation statements)

References 17 publications

Supporting

Mentioning

156

Contrasting

Order By: Relevance

“…Arguably this setting is more natural, as it better aligns with how systems are used in real life. Several notable examples are coreference resolution (Rudinger et al, 2018;Zhao et al, 2018;Kurita et al, 2019), machine translation (Stanovsky et al, 2019;Cho et al, 2019), textual entailment (Dev et al, 2020a), language generation (Sheng et al, 2019), or clinical classification (Zhang et al, 2020).…”

Section: Related Workmentioning

confidence: 99%

“…Such studies on model bias have led to many bias mitigation techniques (e.g., Bolukbasi et al, 2016b;Dev et al, 2020a;Ravfogel et al, 2020;Dev et al, 2020b). In this work, we focus on exploring biases across QA models and expect that our framework could also help future efforts on bias mitigation.…”

Section: Related Workmentioning

confidence: 99%

“…We define templates (T ) for all four bias classes, and select common names, nationalities, ethnicities, and religions for our subject list (X). We use the occupations from Dev et al (2020a) and statements that capture prejudices from StereoSet (Nadeem et al, 2020) to create our attribute list (A). Table 1 shows the sizes of slot-fillers in our templates and the resulted data sizes.…”

Section: Dataset Generationmentioning

confidence: 99%

“…Unfortunately, these representations learn stereotypes often enmeshed in the massive body of text used to train them (Sun et al, 2019). These biases are subsequently passed on to downstream tasks such as co-reference resolution (Rudinger et al, 2018;Zhao et al, 2018), textual entailment (Dev et al, 2020a), and translation (Stanovsky et al, 2019). Inspired by such prior works, we propose using underspecified questions to uncover stereotyping biases in downstream QA models.…”

Section: Introductionmentioning

confidence: 99%

“…Note that prior approaches have often focused on discovering biases by recognizing when a model is categorically incorrect (Stanovsky et al, 2019;Dev et al, 2020a;Nadeem et al, 2020). Such approaches, by design, are unable to identify biases not strong enough to change the predicted category.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

UNQOVERing Stereotyping Biases via Underspecified Questions

Тао¹,

Khashabi²,

Khot³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

Self Cite

View full text Add to dashboard Cite

Warning: This paper contains examples of stereotypes that are potentially offensive.While language embeddings have been shown to have stereotyping biases, how these biases affect downstream question answering (QA) models remains unexplored. We present UN-QOVER, a general framework to probe and quantify biases through underspecified questions. We show that a naïve use of model scores can lead to incorrect bias estimates due to two forms of reasoning errors: positional dependence and question independence. We design a formalism that isolates the aforementioned errors. As case studies, we use this metric to analyze four important classes of stereotypes: gender, nationality, ethnicity, and religion. We probe five transformer-based QA models trained on two QA datasets, along with their underlying language models. Our broad study reveals that (1) all these models, with and without fine-tuning, have notable stereotyping biases in these classes; (2) larger models often have higher bias; and (3) the effect of fine-tuning on bias varies strongly with the dataset and the model size.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Dataset Generationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

UNQOVERing Stereotyping Biases via Underspecified Questions

Тао¹,

Khashabi²,

Khot³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

Self Cite

View full text Add to dashboard Cite

show abstract

Measurement and Mitigation of Bias in Artificial Intelligence: A Narrative Literature Review for Regulatory Science

Gray,

Samala,

Liu

et al. 2023

Clin Pharma and Therapeutics

View full text Add to dashboard Cite

Artificial intelligence (AI) is increasingly being used in decision making across various industries, including the public health arena. Bias in any decision‐making process can significantly skew outcomes, and AI systems have been shown to exhibit biases at times. The potential for AI systems to perpetuate and even amplify biases is a growing concern. Bias, as used in this paper, refers to the tendency toward a particular characteristic or behavior, and thus, a biased AI system is one that shows biased associations entities. In this literature review, we examine the current state of research on AI bias, including its sources, as well as the methods for measuring, benchmarking, and mitigating it. We also examine the biases and methods of mitigation specifically relevant to the healthcare field and offer a perspective on bias measurement and mitigation in regulatory science decision making.

show abstract

Keyword Recommendation for Fair Search

Mishra

Soundarajan

2022

Communications in Computer and Information Science

View full text Add to dashboard Cite

On Measuring and Mitigating Biased Inferences of Word Embeddings

Cited by 88 publications

References 17 publications

UNQOVERing Stereotyping Biases via Underspecified Questions

UNQOVERing Stereotyping Biases via Underspecified Questions

Measurement and Mitigation of Bias in Artificial Intelligence: A Narrative Literature Review for Regulatory Science

Keyword Recommendation for Fair Search

Contact Info

Product

Resources

About