Evaluating Metrics for Bias in Word Embeddings

Schröder, S.; Schulz, Alexander; Kenneweg, Philip; Feldhans, Robert; Hinder, Fabian; Hammer, Barbara

doi:10.48550/arxiv.2111.07864

Cited by 2 publications

(2 citation statements)

References 13 publications

(40 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, a study conducted by [25] quantifies the degree to which gender bias differs with the corpora used for the pre-trained model and fine-tuning with additional data. Schroder et al [26] proposes various metrics of embedding biases and compares their strengths and weaknesses. Garrido et al [27] explore biases in embeddings through the lens of geometric spaces, providing a fresh perspective on the subject.…”

Section: Embedding Association Test Of Biasmentioning

confidence: 99%

Automatically Distinguishing People’s Explicit and Implicit Attitude Bias by Bridging Psychological Measurements with Sentiment Analysis on Large Corpora

Wang,

Zhang,

Xue

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

Social psychological studies show that people’s explicit attitude bias in public expression can differ from their implicit attitude bias in mind. However, the current automatic attitude analysis does not distinguish between explicit and implicit attitude bias. Simulating the psychological measurements of explicit and implicit attitude bias, i.e., self-report assessment (SRA) and implicit association test (IAT), we propose an automatic language-based analysis to distinguish explicit and implicit attitude bias in a large population. By connecting the criteria of SRA and IAT with the statements containing patterns of special words, we derive explicit and implicit attitude bias with the sentiment scores of the statements, which are obtained by pre-trained machine-learning methods. Extensive experiments on four English and Chinese corpora and four pairs of concepts show that the attitude biases obtained by our method on a large population are consistent with those of traditional psychological experiments in the costly small-scale experiments. The maximum gap between the sentiment scores of explicit and implicit biases reaches 0.9329. Furthermore, we achieve new findings on the difference between the evolution of explicit and implicit attitude bias. The maximum variance gap of sentiment scores in the dynamic changes between explicit and implicit biases reaches 0.249.

show abstract

Section: Embedding Association Test Of Biasmentioning

confidence: 99%

Automatically Distinguishing People’s Explicit and Implicit Attitude Bias by Bridging Psychological Measurements with Sentiment Analysis on Large Corpora

Wang,

Zhang,

Xue

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…In this paper, we aim to bridge this gap by studying the performance implications of using such unrepresentative, inorganic corpora (produced by template-based translation or automatic bots creation/generation) by intrinsically evaluating two main NLP upstream tasks: word representation and language modeling, using word analogy and fillmask evaluations, respectively, to capture syntactic and semantic relations between words. We purposely choose these intrinsic evaluations over extrinsic evaluations such as text classification or machine translation because many studies have shown that extrinsic and intrinsic evaluations' results are not consistently correlated, and the performance of NLP downstream tasks is always task-specific and can be significantly influenced by fine-tuning procedures (Faruqui et al, 2016;Schröder et al, 2021;Cao et al, 2022). We believe that evaluating NLP upstream tasks intrinsically will give us useful insights into the quality of the Arabic Wikipedia editions' corpora and show how the quality of corpora affects the performance of these NLP tasks.…”

Section: Introductionmentioning

confidence: 99%

Performance Implications of Using Unrepresentative Corpora in Arabic Natural Language Processing

Alshahrani,

Dey

et al. 2023

Proceedings of ArabicNLP 2023

View full text Add to dashboard Cite

Wikipedia articles are a widely used source of training data for Natural Language Processing (NLP) research, particularly as corpora for lowresource languages like Arabic. However, it is essential to understand the extent to which these corpora reflect the representative contributions of native speakers, especially when many entries in a given language are directly translated from other languages or automatically generated through automated mechanisms. In this paper, we study the performance implications of using inorganic corpora that are not representative of native speakers and are generated through automated techniques such as bot generation or automated template-based translation. The case of the Arabic Wikipedia editions gives a unique case study of this since the Moroccan Arabic Wikipedia edition (ARY) is small but representative, the Egyptian Arabic Wikipedia edition (ARZ) is large but unrepresentative, and the Modern Standard Arabic Wikipedia edition (AR) is both large and more representative. We intrinsically evaluate the performance of two main NLP upstream tasks, namely word representation and language modeling, using word analogy evaluations and fillmask evaluations using our two newly created datasets: Arab States Analogy Dataset (ASAD) and Masked Arab States Dataset (MASD). We demonstrate that for good NLP performance, we need both large and organic corpora; neither alone is sufficient. We show that producing large corpora through automated means can be a counter-productive, producing models that both perform worse and lack cultural richness and meaningful representation of the Arabic language and its native speakers.

show abstract

Evaluating Metrics for Bias in Word Embeddings

Cited by 2 publications

References 13 publications

Automatically Distinguishing People’s Explicit and Implicit Attitude Bias by Bridging Psychological Measurements with Sentiment Analysis on Large Corpora

Automatically Distinguishing People’s Explicit and Implicit Attitude Bias by Bridging Psychological Measurements with Sentiment Analysis on Large Corpora

Performance Implications of Using Unrepresentative Corpora in Arabic Natural Language Processing

Contact Info

Product

Resources

About