Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias

Gonzalez, Ana Valeria; Barrett, Maria; Hvingelby, Rasmus; Webster, Kellie; Søgaard, Anders

doi:10.18653/v1/2020.emnlp-main.209

Cited by 15 publications

(14 citation statements)

References 22 publications

(22 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several works created synthetic datasets to evaluate gender bias (Kiritchenko and Mohammad, 2018;González et al, 2020;Renduchintala and Williams, 2021), e.g., in the context of coreference (Rudinger et al, 2017;Zhao et al, 2018) and machine translation (Stanovsky et al, 2019;Prates et al, 2019;Kocmi et al, 2020), and some works used synthetic datasets to debias models (Saunders et al, 2020;Zhao et al, 2018). Webster et al (2018) and Gonen and Webster (2020), collected natural medium-scale (4.4K sentences) datasets from Wikipedia and reddit, re-spectively, and use them to evaluate gender bias in models of coreference resolution and machine translation.…”

Section: Related Workmentioning

confidence: 99%

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Levy¹,

Lazar²,

Stanovsky³

2021

Findings of the Association for Computational Linguistics: EMNLP 2021

View full text Add to dashboard Cite

Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale and consist mostly of artificial, out-of-distribution sentences. In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gender-role assignments (e.g., female nurses versus male dancers) in corpora from three domains, resulting in a first large-scale gender bias dataset of 108K diverse real-world English sentences. We manually verify the quality of our corpus and use it to evaluate gender bias in various coreference resolution and machine translation models. We find that all tested models tend to over-rely on gender stereotypes when presented with natural inputs, which may be especially harmful when deployed in commercial systems. Finally, we show that our dataset lends itself to finetuning a coreference resolution model, finding it mitigates bias on a held out set. Our dataset and models are publicly available at github.com/ SLAB-NLP/BUG. We hope they will spur future research into gender bias evaluation mitigation techniques in realistic settings.

show abstract

Section: Related Workmentioning

confidence: 99%

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Levy¹,

Lazar²,

Stanovsky³

2021

Findings of the Association for Computational Linguistics: EMNLP 2021

View full text Add to dashboard Cite

show abstract

“…Danish has gendered possessive pronouns, but nongendered reflexive pronouns. This has made it useful as an unambiguous testbed for gender bias in natural language inference models, machine translation models, and language models (González et al, 2020). But automatic coreference resolution for Danish has received no attention, and there was no established evaluation set for this task.…”

Section: Related Workmentioning

confidence: 99%

Resources and Evaluations for Danish Entity Resolution

Barrett¹,

Lam²,

Wu³

et al. 2021

Proceedings of the Fourth Workshop on Computational Models of Reference, Anaphora and Coreference

Self Cite

View full text Add to dashboard Cite

Automatic coreference resolution is understudied in Danish even though most of the Danish Dependency Treebank (Buch-Kromann, 2003) is annotated with coreference relations. This paper describes a conversion of its partial, yet well-documented, coreference relations into coreference clusters and the training and evaluation of coreference models on this data. To the best of our knowledge, these are the first publicly available neural coreference models for Danish. We also present a new entity linking annotation on the dataset using Wiki-Data identifiers, a named entity disambiguation (NED) dataset, and a larger automatically created NED dataset enabling wikily supervised NED models. The entity linking annotation is benchmarked using a state-of-the-art neural entity disambiguation model.

show abstract

“…With the rise in the economy comes more slots in schools, but still, the money is needed, and with the economy rising, some families can't afford to pay the fees for all their children. Reports show that even though most families can send their children to elementary, primary and junior high school, they cannot send them to tertiary schools (González, et al, 2020). Another factor affecting gender inequality in education is the number of children in the household.…”

Section: Rural Areas and Gender Bias In Educationmentioning

confidence: 99%

Gender Inequality in Education in China

Li¹

2021

Advances in Social Science, Education and Humanities Research

View full text Add to dashboard Cite

This paper investigates gender inequality in China. It includes, more specifically, research into gender inequality differences between rural and urban areas in order to compare them. In addition, this paper focuses on gender inequality in educational opportunities in China. Finally, the study delves into how gender inequality in China has evolved over time using reliable graphs and research, and discusses how educational opportunities for women and men in China have changed over the decades.

show abstract

Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias

Cited by 15 publications

References 22 publications

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Resources and Evaluations for Danish Entity Resolution

Gender Inequality in Education in China

Contact Info

Product

Resources

About