Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization

Masud, Sarah; Bedi, Manjot; Khan, Mohammad Aflah; Akhtar, Md. Shad; Chakraborty, Tanmoy

doi:10.1145/3534678.3539161

Cited by 6 publications

(3 citation statements)

References 76 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…From a human-computer interaction perspective, research has focused on mitigating uncivil behaviors through platform-level intervention and policy (Chandrasekharan et al 2017;Jhaver et al 2021), encouraging user and community self-moderation (Seering, Kraut, and Dabbish 2017), and by examining how platform and UX design promote desirable undesirable behavior (Munn 2020;Seering et al 2019). Uncivil speech detection can be formulated as supervised machine learning and NLP tasks (Davidson et al 2017), often with the aim of building automated systems for filtering such unwanted content either tailored to specific platforms (Masud et al 2022;Nobata et al 2016) or for general use across contexts (Jigsaw 2021; Lees et al 2022). There is also a significant body of work on measuring and classifying uncivil behavior beyond textual communication in online gaming contexts (Canossa et al 2021;Kou 2020).…”

Section: Related Work On Digital Civilitymentioning

confidence: 99%

Measuring Causal Effects of Civil Communication without Randomization

Liu,

Ungar,

Kording

et al. 2024

ICWSM

View full text Add to dashboard Cite

Understanding the causal effects of civility is critical when analyzing online social communication, yet measuring causality is difficult. A/B tests and other randomized experiments are the gold standard for establishing causal effects but they are inapplicable in this setting due to 1) the inability to control civility levels in an experiment, and more importantly, 2) ethical constraints on intentionally randomizing civility levels. We develop a novel quasi-experimental approach to quantify the causal effect of civility in online communities on the Roblox social 3D platform without requiring explicit randomization. This method uses residual stochasticity in the "matchmaking" assignment of users to servers as a quasi-randomization mechanism in observational historical data. We find that assigning a user to a server with higher levels of civil communication could increase engagement time by as much as 1.5% in particular experiences. Given the 4.8B person hours spent monthly on the platform, this implies a potential increase of over 8,000 person years of social interaction every month. Furthermore, this effect is mis-estimated by non-causal methods. Quasi-experimental approaches promise new avenues for measuring the causal impact of user behavior in online communities without adversely affecting users through randomized experiments.

show abstract

Section: Related Work On Digital Civilitymentioning

confidence: 99%

Measuring Causal Effects of Civil Communication without Randomization

Liu,

Ungar,

Kording

et al. 2024

ICWSM

View full text Add to dashboard Cite

show abstract

“…Inspired by our experiments, in partnership with Wipro AI, we are developing an interactive web interface for contextual hate speech detection [34]. This interface will be a part of their more extensive pipeline to flag and analyze harmful content on the web.…”

Section: Content Moderation Pipelinementioning

confidence: 99%

“…• Benchmarking: We benchmark GOTHate with ten diverse and widely-studied baseline methods (Section 6). • Content moderation pipeline: This research has led to the creation of a hate speech detection pipeline currently under development in collaboration with Wipro AI [34,35] (Section 7). Reproducibility: The source code and sample dataset are publicly available on our Github 2 .…”

Section: Introductionmentioning

confidence: 99%

Revisiting Hate Speech Benchmarks: From Data Curation to System Deployment

Kulkarni

Masud

Goyal

et al. 2023

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

View full text Add to dashboard Cite

Social media is awash with hateful content, much of which is often veiled with linguistic and topical diversity. The benchmark datasets used for hate speech detection do not account for such divagation as they are predominantly compiled using hate lexicons. However, capturing hate signals becomes challenging in neutrally-seeded malicious content. Thus, designing models and datasets that mimic the real-world variability of hate warrants further investigation.To this end, we present GOTHate, a large-scale code-mixed crowdsourced dataset of around 51𝑘 posts for hate speech detection from Twitter. GOTHate is neutrally seeded, encompassing different languages and topics. We conduct detailed comparisons of GOTHate with the existing hate speech datasets, highlighting its novelty. We benchmark it with 10 recent baselines. Our extensive empirical and benchmarking experiments suggest that GOTHate is hard to classify in a text-only setup. Thus, we investigate how adding endogenous signals enhances the hate speech detection task. We augment GOTHate with the user's timeline information and ego network, bringing the overall data source closer to the real-world setup for understanding hateful content. Our proposed solution HEN-mBERT, is a modular, multilingual, mixture-of-experts model that enriches the linguistic subspace with latent endogenous signals from history, topology, and exemplars. HEN-mBERT transcends the best baseline by 2.5% and 5% in overall macro-F1 and hate class F1, respectively. Inspired by our experiments, in partnership with Wipro AI, we are developing a semi-automated pipeline to detect hateful content as a part of their mission to tackle online harm. 1 CCS CONCEPTS• Computing methodologies → Language resources; Supervised learning by classification.

show abstract

Political mud slandering and power dynamics during Indian assembly elections

Masud,

Charaborty

2023

Soc. Netw. Anal. Min.

View full text Add to dashboard Cite

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization

Cited by 6 publications

References 76 publications

Measuring Causal Effects of Civil Communication without Randomization

Measuring Causal Effects of Civil Communication without Randomization

Revisiting Hate Speech Benchmarks: From Data Curation to System Deployment

Political mud slandering and power dynamics during Indian assembly elections

Contact Info

Product

Resources

About