Companion Proceedings of the 2019 World Wide Web Conference 2019
DOI: 10.1145/3308560.3317593
|View full text |Cite
|
Sign up to set email alerts
|

Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification

Abstract: Unintended bias in Machine Learning can manifest as systemic differences in performance for different demographic groups, potentially compounding existing challenges to fairness in society at large. In this paper, we introduce a suite of threshold-agnostic metrics that provide a nuanced view of this unintended bias, by considering the various ways that a classifier's score distribution can vary across designated groups. We also introduce a large new test set of online comments with crowd-sourced annotations fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
276
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 241 publications
(295 citation statements)
references
References 11 publications
2
276
0
Order By: Relevance
“…8 https ://www.bbc.co.uk/news/uk-north ern-irela nd-53386 976. 9 https ://www.thegu ardia n.com/uk-news/2020/may/14/polic e-vow-to-break -up-plann ed-anti-lockd ownprote sts-in-uk-citie s 10 https ://www.teleg raph.co.uk/news/2020/04/20/coron aviru s-world -erupt s-prote st-again st-lockd ownpictu res/ 11 https ://www.washi ngton post.com/world /europ e/face-masks -coron aviru s-uk/2020/07/14/d05df b7c-c5d4-11ea-a825-87220 04e41 50_story .html. 12 https ://www.teleg raph.co.uk/news/2020/04/06/brita ins-hubri stic-scien tific -advis ers-wrong -publi c-shoul d-weari ng/.…”
Section: Dimensions Of Political Discourse In the Ukmentioning
confidence: 99%
See 1 more Smart Citation
“…8 https ://www.bbc.co.uk/news/uk-north ern-irela nd-53386 976. 9 https ://www.thegu ardia n.com/uk-news/2020/may/14/polic e-vow-to-break -up-plann ed-anti-lockd ownprote sts-in-uk-citie s 10 https ://www.teleg raph.co.uk/news/2020/04/20/coron aviru s-world -erupt s-prote st-again st-lockd ownpictu res/ 11 https ://www.washi ngton post.com/world /europ e/face-masks -coron aviru s-uk/2020/07/14/d05df b7c-c5d4-11ea-a825-87220 04e41 50_story .html. 12 https ://www.teleg raph.co.uk/news/2020/04/06/brita ins-hubri stic-scien tific -advis ers-wrong -publi c-shoul d-weari ng/.…”
Section: Dimensions Of Political Discourse In the Ukmentioning
confidence: 99%
“…Data from Kaggle's 2012 challenge, "Detecting Insults in Social Commentary" [5], were used to evaluate the success of the approach, this being in keeping with our definition of abuse (many more recent corpora define this differently, e.g., "toxicity", as in the Jigsaw corpus [9], is much broader). Our approach was shown to have an accuracy of 80%, and a precision/recall/F1 of 0.72/0.47/0.57.…”
Section: Rule-based Identification Of Abusive Languagementioning
confidence: 99%
“…With two protected groups, pinned AUC works by resampling the data such that each of the two groups make up 50% of the data, and then calculating the ROC AUC on the resampled dataset. Based on the wellknown equivalence between ROC AUC and average pairwise accuracy, Borkan et al (2019) demonstrate that pinned AUC, as well as their proposed weighted pinned AUC metric, can be decomposed as a linear combination of within-group and cross-group pairwise accuracies. In other words, both pinned AUC and weighted pinned AUC can be written as linear combinations of different pairwise accuracies A Gi>Gj in (1).…”
Section: Related Workmentioning
confidence: 99%
“…Prior literature on detecting anti-social acts at scale has primarily used supervised machine learning that predominantly relies on content-based features to identify relevant posts (Al-Makhadmeh & Tolba, 2019; Kwok & Wang, 2013;Pitsilis, Ramampiaro, & Langseth, 2018;Gorrell et al, 2019;Borkan et al, 2019;. For example, Dybala et al (2010) used support vector machines (SVM) to classify comments posted on unofficial school websites in Japan into those that are potentially harmful and not.…”
Section: Content-based Approachesmentioning
confidence: 99%