Evaluating Gender Bias in Hindi-English Machine Translation

Gupta, Gauri; Ramesh, Krithika; Singh, Rajesh

doi:10.48550/arxiv.2106.08680

Cited by 1 publication

(1 citation statement)

References 2 publications

(2 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This is especially troubling for India, a pluralistic nation of 1.4 billion people, with fast-growing investments in NLP from the government 3 , and the private sector 4 . There is commendable recent work on NLP fairness in Indian languages like Hindi, Bengali, Telugu (Pujari et al, 2019;Malik et al, 2021;Gupta et al, 2021). But, for a nation with many religions, ethnicities, and cultures, recontextualizing NLP fairness needs to account for the various axes of social disparities in the Indian society, their proxies in language data, the disparate NLP capabilities in Indian languages, and the (lack of) resources for bias evaluation.…”

Section: Introductionmentioning

confidence: 99%

Re-contextualizing Fairness in NLP: The Case of India

Bhatt¹,

Dev²,

Talukdar³

et al. 2022

Preprint

View full text Add to dashboard Cite

Recent research has revealed undesirable biases in NLP data and models. However, these efforts focus of social disparities in West, and are not directly portable to other geo-cultural contexts. In this paper, we focus on NLP fairness in the context of India. We start with a brief account of prominent axes of social disparities in India. We build resources for fairness evaluation in the Indian context and use them to demonstrate prediction biases along some of the axes. We then delve deeper into social stereotypes for Region and Religion, demonstrating its prevalence in corpora and models. Finally, we outline a holistic research agenda to re-contextualize NLP fairness research for the Indian context, accounting for Indian societal context, bridging technological gaps in NLP capabilities and resources, and adapting to Indian cultural values. While we focus on 'India', this framework can be generalized to other geo-cultural contexts.

show abstract

Section: Introductionmentioning

confidence: 99%