Why Knowledge Distillation Amplifies Gender Bias and How to Mitigate from the Perspective of DistilBERT

Ahn, Jaimeen; Lee, Hwaran; Kim, Jin-Hwa; Oh, Alice

doi:10.18653/v1/2022.gebnlp-1.27

Cited by 5 publications

(6 citation statements)

References 10 publications

(15 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The only exception is Distilled mUSE, where the job-prestige dimension applied to countries still correlates with the country's GDP and the east-west axis. This is consistent with previous work showing that distilled student models exhibit more biases than model trained on authentic data (Vamvas and Sennrich, 2021;Ahn et al, 2022).…”

Section: Resultssupporting

confidence: 93%

“…This result suggests that the models do not connect individual social prestige with the country of origin. The exception is a small model distilled from Multilingual Universal Sentence Encoder (Yang et al, 2020) that seems to mix these two and thus confirms previous work claiming that distilled models are more prone to biases (Ahn et al, 2022).…”

Section: Introductionsupporting

confidence: 83%

See 1 more Smart Citation

Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries

Libovický

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

We study how multilingual sentence representations capture European countries and occupations and how this differs across European languages. We prompt the models with templated sentences that we machine-translate into 12 European languages and analyze the most prominent dimensions in the embeddings. Our analysis reveals that the most prominent feature in the embedding is the geopolitical distinction between Eastern and Western Europe and the country's economic strength in terms of GDP. When prompted specifically for job prestige, the embedding space clearly distinguishes high and low-prestige jobs. The occupational dimension is uncorrelated with the most dominant country dimensions in three out of four studied models. The exception is a small distilled model that exhibits a connection between occupational prestige and country of origin, which is a potential source of nationality-based discrimination. Our findings are consistent across languages.

show abstract

Section: Resultssupporting

confidence: 93%

Section: Introductionsupporting

confidence: 83%

Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries

Libovický

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

show abstract

“…Since such a biased ratio is not favorable, the generative AI software that mimics such a biased ratio is also not favorable. On the other hand, there is also a chance that AI models could generate a more biased ratio [1]. Such imbalanced generations may reinforce the bias or stereotypes.…”

Section: Social Biasmentioning

confidence: 99%

“…Image generation models, which generate images from a given text, have recently drawn a lot of interest from academia and the industry. For example, Stable Diffusion [37], an open-sourced latent text-toimage diffusion model, has 60K stars on github 1 . And Midjourney, an AI image generation commercial software product launched on July 2022, has more than 15 million users [13].…”

Section: Introductionmentioning

confidence: 99%

Characterization of Two BAHD Acetyltransferases Highly Expressed in the Flowers of Jasminum sambac (L.) Aiton

Wang

Zhang

Chen

et al. 2021

Plants

View full text Add to dashboard Cite

Volatile benzenoid compounds are found in diverse aromatic bouquets emitted by most moth-pollinated flowers. The night-blooming Jasminum sambac is widely cultivated worldwide in the tropics and subtropics for ornamental and industrial purposes owing to its fragrant flowers. Benzylacetate is a characteristic constituent in jasmine scent which makes up to approximately 20–30% of the total emission in the headspace or extract, but the biosynthesis enzymes and the encoding genes have not yet been described. Here, we identify two cytosolic BAHD acyltransferases specifically expressed in the petals with a positive correlation closely to the emission pattern of the volatile benzenoids. Both JsBEAT1 and JsBEAT2 could use benzylalcohol and acetate-CoA as substrates to make benzylacetate in vitro. The recombinant GST-JsBEAT1 has an estimated apparent Km of 447.3 μM for benzylalcohol and 546.0 μM for acetate-CoA, whereas in the instance of the His-JsBEAT2, the Km values are marginally lower, being 278.7 and 317.3 μM, respectively. However, the catalytic reactions by the GST-JsBEAT1 are more efficient than that by the His-JsBEAT2, based on the steady-state kcat parameters. Furthermore, ectopic expression of JsBEAT1 and JsBEAT2 in the transgenic P. hybrida plants, driven by a flower-specific promotor, significantly enhances the biosynthesis of benzylbenzoate and benzylacetate, as well as the total VOCs.

show abstract

“…Finally, while the interplay and tradeoff between privacy, efficiency, and fairness in tabular data has received extensive examination (Hooker et al, 2020;Lyu et al, 2020) comparatively fewer studies have been conducted in NLP (Tal et al, 2022;Ahn et al, 2022;Hessenthaler et al, 2022).…”

Section: Introductionmentioning

confidence: 99%

Fairness in Language Models Beyond English: Gaps and Challenges

Ramesh¹,

Sitaram²,

Choudhury³

2023

Preprint

View full text Add to dashboard Cite

With language models becoming increasingly ubiquitous, it has become essential to address their inequitable treatment of diverse demographic groups and factors. Most research on evaluating and mitigating fairness harms has been concentrated on English, while multilingual models and non-English languages have received comparatively little attention. This paper presents a survey of fairness in multilingual and non-English contexts, highlighting the shortcomings of current research and the difficulties faced by methods designed for English. We contend that the multitude of diverse cultures and languages across the world makes it infeasible to achieve comprehensive coverage in terms of constructing fairness datasets. Thus, the measurement and mitigation of biases must evolve beyond the current datasetdriven practices that are narrowly focused on specific dimensions and types of biases and, therefore, impossible to scale across languages and cultures.

show abstract

Why Knowledge Distillation Amplifies Gender Bias and How to Mitigate from the Perspective of DistilBERT

Cited by 5 publications

References 10 publications

Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries

Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries

Characterization of Two BAHD Acetyltransferases Highly Expressed in the Flowers of Jasminum sambac (L.) Aiton

Fairness in Language Models Beyond English: Gaps and Challenges

Contact Info

Product

Resources

About