Christine Basta scite author profile

Gender bias is highly impacting natural language processing applications. Word embeddings have clearly been proven both to keep and amplify gender biases that are present in current data sources. Recently, contextualized word embeddings have enhanced previous word embedding techniques by computing word vector representations dependent on the sentence they appear in.In this paper, we study the impact of this conceptual change in the word embedding computation in relation with gender bias. Our analysis includes different measures previously applied in the literature to standard word embeddings. Our findings suggest that contextualized word embeddings are less biased than standard ones even when the latter are debiased.

show abstract

Evaluating the Underlying Gender Bias in Contextualized Word Embeddings

Basta

Costa-jussà

Casas

2019

Preprint

View full text Add to dashboard Cite

Towards Mitigating Gender Bias in a decoder-based Neural Machine Translation model by Adding Contextual Information

Basta¹,

Costa-jussà²,

Fonollosa³

2020

View full text Add to dashboard Cite

Extensive study on the underlying gender bias in contextualized word embeddings

Basta

Costa-jussà

Casas

2020

Neural Comput & Applic

View full text Add to dashboard Cite

The TALP-UPC Machine Translation Systems for WMT19 News Translation Task: Pivoting Techniques for Low Resource MT

Casas¹,

Fonollosa²,

Escolano³

et al. 2019

View full text Add to dashboard Cite

In this article, we describe the TALP-UPC research group participation in the WMT19 news translation shared task for Kazakh-English. Given the low amount of parallel training data, we resort to using Russian as pivot language, training subword-based statistical translation systems for Russian-Kazakh and Russian-English that were then used to create two synthetic pseudo-parallel corpora for Kazakh-English and English-Kazakh respectively. Finally, a self-attention model based on the decoder part of the Transformer architecture was trained on the two pseudoparallel corpora.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Christine Basta

Evaluating the Underlying Gender Bias in Contextualized Word Embeddings

Evaluating the Underlying Gender Bias in Contextualized Word Embeddings

Towards Mitigating Gender Bias in a decoder-based Neural Machine Translation model by Adding Contextual Information

Extensive study on the underlying gender bias in contextualized word embeddings

The TALP-UPC Machine Translation Systems for WMT19 News Translation Task: Pivoting Techniques for Low Resource MT

Contact Info

Product

Resources

About