Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification

Borkan, Daniel; Dixon, Lucas; Sorensen, Jeffrey; Thain, Nithum; Vasserman, Lucy

doi:10.1145/3308560.3317593

Cited by 241 publications

(295 citation statements)

References 11 publications

Supporting

Mentioning

276

Contrasting

Order By: Relevance

“…8 https ://www.bbc.co.uk/news/uk-north ern-irela nd-53386 976. 9 https ://www.thegu ardia n.com/uk-news/2020/may/14/polic e-vow-to-break -up-plann ed-anti-lockd ownprote sts-in-uk-citie s 10 https ://www.teleg raph.co.uk/news/2020/04/20/coron aviru s-world -erupt s-prote st-again st-lockd ownpictu res/ 11 https ://www.washi ngton post.com/world /europ e/face-masks -coron aviru s-uk/2020/07/14/d05df b7c-c5d4-11ea-a825-87220 04e41 50_story .html. 12 https ://www.teleg raph.co.uk/news/2020/04/06/brita ins-hubri stic-scien tific -advis ers-wrong -publi c-shoul d-weari ng/.…”

Section: Dimensions Of Political Discourse In the Ukmentioning

confidence: 99%

“…Data from Kaggle's 2012 challenge, "Detecting Insults in Social Commentary" [5], were used to evaluate the success of the approach, this being in keeping with our definition of abuse (many more recent corpora define this differently, e.g., "toxicity", as in the Jigsaw corpus [9], is much broader). Our approach was shown to have an accuracy of 80%, and a precision/recall/F1 of 0.72/0.47/0.57.…”

Section: Rule-based Identification Of Abusive Languagementioning

confidence: 99%

See 1 more Smart Citation

Vindication, virtue, and vitriol

2020

View full text Add to dashboard Cite

COVID-19 has given rise to a lot of malicious content online, including hate speech, online abuse, and misinformation. British MPs have also received abuse and hate on social media during this time. To understand and contextualise the level of abuse MPs receive, we consider how ministers use social media to communicate about the pandemic, and the citizen engagement that this generates. The focus of the paper is on a large-scale, mixed-methods study of abusive and antagonistic responses to UK politicians on Twitter, during the pandemic from early February to late May 2020. We find that pressing subjects such as financial concerns attract high levels of engagement, but not necessarily abusive dialogue. Rather, criticising authorities appears to attract higher levels of abuse during this period of the pandemic. In addition, communicating about subjects like racism and inequality may result in accusations of virtue signalling or pandering by some users. This work contributes to the wider understanding of abusive language online, in particular that which is directed at public officials.

show abstract

Section: Dimensions Of Political Discourse In the Ukmentioning

confidence: 99%

Section: Rule-based Identification Of Abusive Languagementioning

confidence: 99%

Vindication, virtue, and vitriol

2020

View full text Add to dashboard Cite

show abstract

“…With two protected groups, pinned AUC works by resampling the data such that each of the two groups make up 50% of the data, and then calculating the ROC AUC on the resampled dataset. Based on the wellknown equivalence between ROC AUC and average pairwise accuracy, Borkan et al (2019) demonstrate that pinned AUC, as well as their proposed weighted pinned AUC metric, can be decomposed as a linear combination of within-group and cross-group pairwise accuracies. In other words, both pinned AUC and weighted pinned AUC can be written as linear combinations of different pairwise accuracies A Gi>Gj in (1).…”

Section: Related Workmentioning

confidence: 99%

Pairwise Fairness for Ranking and Regression

Narasimhan

Cotter

Gupta

et al. 2020

AAAI

View full text Add to dashboard Cite

We present pairwise fairness metrics for ranking models and regression models that form analogues of statistical fairness notions such as equal opportunity, equal accuracy, and statistical parity. Our pairwise formulation supports both discrete protected groups, and continuous protected attributes. We show that the resulting training problems can be efficiently and effectively solved using existing constrained optimization and robust optimization techniques developed for fair classification. Experiments illustrate the broad applicability and trade-offs of these methods.

show abstract

“…Prior literature on detecting anti-social acts at scale has primarily used supervised machine learning that predominantly relies on content-based features to identify relevant posts (Al-Makhadmeh & Tolba, 2019; Kwok & Wang, 2013;Pitsilis, Ramampiaro, & Langseth, 2018;Gorrell et al, 2019;Borkan et al, 2019;. For example, Dybala et al (2010) used support vector machines (SVM) to classify comments posted on unofficial school websites in Japan into those that are potentially harmful and not.…”

Section: Content-based Approachesmentioning

confidence: 99%

Studying Anti-Social Behaviour on Reddit with Communalytic

Gruzd¹,

Mai²,

Vahedi³

2020

Preprint

View full text Add to dashboard Cite

The chapter presents a new social media research tool for studying subreddits (i.e., groups) on Reddit called Communalytic. It is an easy-to-use, web-based tool that can collect, analyze and visualize publicly available data from Reddit. In addition to collecting data, Communalytic can assess the toxicity of Reddit posts and replies using a machine learning API. The resulting anti-social scores from the toxicity analysis are then added as weights to each tie in a "who replies to whom" communication network, allowing researchers to visually identify and study toxic exchanges happening within a subreddit. The chapter consists of two parts: first, it introduces our methodology and Communalytic’s main functionalities. Second, it presents a case study of a public subreddit called r/metacanada. This subreddit, popular among the Canadian alt-right, was selected due to its polarizing nature. The case study demonstrates how Communalytic can support researchers studying toxicity in online communities. Specifically, by having access to this additional layer of information about the nature of the communication ties among group members, we were able to provide a more nuanced description of the group dynamics.

show abstract

Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification

Cited by 241 publications

References 11 publications

Vindication, virtue, and vitriol

Vindication, virtue, and vitriol

Pairwise Fairness for Ranking and Regression

Studying Anti-Social Behaviour on Reddit with Communalytic

Contact Info

Product

Resources

About