Lipstick on a Pig:

Gonen, Hila; Goldberg, Yoav

doi:10.18653/v1/n19-1061

Cited by 100 publications

(69 citation statements)

References 5 publications

(15 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Figure 7 shows the result of applying the debiasing technique, highlighting that some of the spurious correlations have indeed been removed. It is important to note though, that the above technique does not reliably remove all relevant bias in the embeddings and that bias is still measurably existing in the embedding space as Gonen and Goldberg (2019) have shown. This can be verified with whatlies, by plotting the neighbours of the biased and debiased space: As the output shows, the neighbourhoods of maid in the biased and debiased space are almost equivalent, with e.g.…”

Section: Emb_of_pairsplot_distance(metric="cosine")mentioning

confidence: 99%

Going Beyond T-SNE: Exposing whatlies in Text Embeddings

Warmerdam¹,

Kober²,

Tatman³

2020

Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS)

View full text Add to dashboard Cite

We introduce whatlies, an open source toolkit for visually inspecting word and sentence embeddings. The project offers a unified and extensible API with current support for a range of popular embedding backends including spaCy, tfhub, huggingface transformers, gensim, fastText and BytePair embeddings. The package combines a domain specific language for vector arithmetic with visualisation tools that make exploring word embeddings more intuitive and concise. It offers support for many popular dimensionality reduction techniques as well as many interactive visualisations that can either be statically exported or shared via Jupyter notebooks. The project documentation is available from https:// rasahq.github.io/whatlies/.

show abstract

Section: Emb_of_pairsplot_distance(metric="cosine")mentioning

confidence: 99%

Going Beyond T-SNE: Exposing whatlies in Text Embeddings

Warmerdam¹,

Kober²,

Tatman³

2020

Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS)

View full text Add to dashboard Cite

show abstract

“…Fairness research in NLP has seen tremendous growth in the past few years (e.g., (Bolukbasi et al, 2016;Caliskan et al, 2017;Webster et al, 2018;Díaz et al, 2018;Dixon et al, 2018;De-Arteaga et al, 2019;Gonen and Goldberg, 2019;Manzini et al, 2019)) over a range of NLP tasks such as co-reference resolution and machine translation, as well as the tasks we studied -sentiment analysis and toxicity prediction. Some of this work study bias by swapping names in sentence templates (Caliskan et al, 2017;Kiritchenko and Mohammad, 2018;May et al, 2019;Gonen and Goldberg, 2019); however they use synthetic sentence templates, while we extract naturally occurring sentences from the target corpus.…”

Section: Related Workmentioning

confidence: 99%

Perturbation Sensitivity Analysis to Detect Unintended Model Biases

Prabhakaran¹,

Hutchinson²,

Mitchell³

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

show abstract

“…Under their evaluation, they find they can nearly perfectly remove bias in an analogical reasoning task. However, subsequent work (Gonen and Goldberg, 2019;Hall Maudslay et al, 2019) has indicated that gender bias still lingers in the embeddings, despite Bolukbasi et al (2016)'s strong experimental results. In the development of their method, Bolukbasi et al (2016) make a critical and unstated assumption: Gender bias forms a linear subspace of word embedding space.…”

Section: Introductionmentioning

confidence: 96%

“…As previously noted, there are now multiple bias removal methodologies (Zhao et al, 2018(Zhao et al, , 2019May et al, 2019) that have succeed the method by Bolukbasi et al (2016). Furthermore Gonen and Goldberg (2019) point out multiple flaws in Bolukbasi et al (2016)'s bias mitigation technique and the aforementioned methods. Nonetheless we believe that this method has received sufficient attention from the community such that research into its properties is both interesting and useful.…”

Section: Introductionmentioning

confidence: 99%

“…First, we consider the Word Embedding Association Test (WEAT; Caliskan et al, 2017); we notice that across five non-linear kernels and convex combinations thereof, there is seemingly no significant difference between the non-linear bias mitigation technique and the linear one. Secondly, we consider the professions task (Bolukbasi et al, 2016;Gonen and Goldberg, 2019) that measures how word embeddings representing different professions are potentially genderstereotyped. Again, as with the WEAT evaluation, we find that our non-linear bias mitigation technique performs on par with the linear method.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Vargas

Cotterell

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Bolukbasi et al. (2016) presents one of the first gender bias mitigation techniques for word embeddings. Their method takes pre-trained word embeddings as input and attempts to isolate a linear subspace that captures most of the gender bias in the embeddings. As judged by an analogical evaluation task, their method virtually eliminates gender bias in the embeddings. However, an implicit and untested assumption of their method is that the bias subspace is actually linear. In this work, we generalize their method to a kernelized, non-linear version. We take inspiration from kernel principal component analysis and derive a nonlinear bias isolation technique. We discuss and overcome some of the practical drawbacks of our method for non-linear gender bias mitigation in word embeddings and analyze empirically whether the bias subspace is actually linear. Our analysis shows that gender bias is in fact well captured by a linear subspace, justifying the assumption of Bolukbasi et al. (2016).

show abstract

Lipstick on a Pig:

Cited by 100 publications

References 5 publications

Going Beyond T-SNE: Exposing whatlies in Text Embeddings

Going Beyond T-SNE: Exposing whatlies in Text Embeddings

Perturbation Sensitivity Analysis to Detect Unintended Model Biases

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Contact Info

Product

Resources

About