Twitter Universal Dependency Parsing for African-American and Mainstream American English

Blodgett, Su Lin; Wei, Johnny Tian-Zheng; O’Connor, Brendan

doi:10.18653/v1/p18-1131

Cited by 27 publications

(22 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…An equally important consideration, in addition to whom the data describes is who authored the data. For example, Blodgett et al (2018) show that parsing systems trained on White Mainstream American English perform poorly on African American English (AAE). 5 In a more general example, Wikipedia has become a popular data source for many NLP tasks.…”

Section: Nlp Systems Encode Racial Biasmentioning

confidence: 99%

“…• Annotation schema Returning to Blodgett et al (2018), this work defines new parsing standards for formalisms common in AAE, demonstrating how parsing labels themselves were not designed for racialized language varieties. • Annotation instructions Sap et al (2019) show that annotators are less likely to label tweets using AAE as offensive if they are told the likely language varieties of the tweets.…”

Section: Nlp Systems Encode Racial Biasmentioning

confidence: 99%

“…While observed race is often appropriate for examining discrimination and some types of disparities, it is impossible to assess potential harms and benefits of NLP systems without assessing their performance over text generated by and directed to people of different races. The corpus from Blodgett et al (2016) does serve as a starting point and forms the basis of most current work assessing performance gaps in NLP models (Sap et al, 2019;Blodgett et al, 2018;Xia et al, 2020;Xu et al, 2019;Groenwold et al, 2020), but even this corpus is explicitly not intended to infer race.…”

Section: Common Data Sets Are Narrow In Scopementioning

confidence: 99%

See 2 more Smart Citations

A Survey of Race, Racism, and Anti-Racism in NLP

Field¹,

Blodgett²,

Waseem³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

Despite inextricable ties between race and language, little work has considered race in NLP research and development. In this work, we survey 79 papers from the ACL anthology that mention race. These papers reveal various types of race-related bias in all stages of NLP model development, highlighting the need for proactive consideration of how NLP systems can uphold racial hierarchies. However, persistent gaps in research on race and NLP remain: race has been siloed as a niche topic and remains ignored in many NLP tasks; most work operationalizes race as a fixed singledimensional variable with a ground-truth label, which risks reinforcing differences produced by historical racism; and the voices of historically marginalized people are nearly absent in NLP literature. By identifying where and how NLP literature has and has not considered race, especially in comparison to related fields, our work calls for inclusion and racial justice in NLP research practices. 3 The ACL Anthology includes papers from all official ACL venues and some non-ACL events listed in Appendix A, as of December 2020 it included 6, 200 papers 5 We note that conceptualizations of AAE and the accompanying terminology for the variety have shifted considerably in the last half century; see King (2020) for an overview.6 https://bit.ly/2Yv07IL 7 https://bit.ly/3j2weZA

show abstract

Section: Nlp Systems Encode Racial Biasmentioning

confidence: 99%

Section: Nlp Systems Encode Racial Biasmentioning

confidence: 99%

Section: Common Data Sets Are Narrow In Scopementioning

confidence: 99%

See 1 more Smart Citation

A Survey of Race, Racism, and Anti-Racism in NLP

Field¹,

Blodgett²,

Waseem³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

View full text Add to dashboard Cite

show abstract

“…Table 2 expresses 22 metrics from the literature as instances of our generalized metrics from Section 3. The presented metrics span a number of NLP tasks, including text classification (Dixon et al, 2018;Garg et al, 2019;Borkan et al, 2019;Prabhakaran et al, 2019), relation extraction (Gaut et al, 2020), text generation (Huang et al, 2020a) and dependency parsing (Blodgett et al, 2018). We arrive at this list by reviewing 146 papers that study bias from the survey of Blodgett et al…”

Section: Classifying Existing Fairness Metricsmentioning

confidence: 99%

Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

Czarnowska

Vyas

Shah

2021

Transactions of the Association for Computational Linguistics

View full text Add to dashboard Cite

Measuring bias is key for better understanding and addressing unfairness in NLP/ML models. This is often done via fairness metrics, which quantify the differences in a model’s behaviour across a range of demographic groups. In this work, we shed more light on the differences and similarities between the fairness metrics used in NLP. First, we unify a broad range of existing metrics under three generalized fairness metrics, revealing the connections between them. Next, we carry out an extensive empirical comparison of existing metrics and demonstrate that the observed differences in bias measurement can be systematically explained via differences in parameter choices for our generalized metrics.

show abstract

“…They evaluated the performance of two parsers on this dataset and found that their performance lagged significantly in comparison to their performance on the Italian UD Treebank. A new dataset of 500 tweets within the framework of UD 2.0 was developed and annotated by [5], out of which 250 tweets are in African American English. TWEEBANK V2 was developed by [13] by completely labelling TWEEBANK V1 according to UD 2.0 along with additionally sampled tweets, for a total of 3,550 tweets.…”

Section: Related Workmentioning

confidence: 99%

Universal Dependencies for Urdu Noisy Text

2021

IJATCSE

View full text Add to dashboard Cite

In this paper, the process of creating a Dependency Treebank for tweetsin Urdu,a morphologically rich and less-resourced languageis described. The 500 Urdu tweets treebank iscreated by manually annotating the treebank withlemma, POS tags, morphological and syntacticrelations using the Universal Dependencies annotation scheme, adopted to the peculiarities of Urdu social media text. annotation process is evaluated through Inter-annotator agreement for dependency relations and total agreement of 94.5% and resultant weighted Kappa = 0.876was observed. The treebank is evaluated through 10-fold cross validation using Maltparserwith various feature settings. Results show average UAS score of 74%, LAS score of 62.9% and LA score of 69.8%.

show abstract

Twitter Universal Dependency Parsing for African-American and Mainstream American English

Cited by 27 publications

References 17 publications

A Survey of Race, Racism, and Anti-Racism in NLP

A Survey of Race, Racism, and Anti-Racism in NLP

Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

Universal Dependencies for Urdu Noisy Text

Contact Info

Product

Resources

About