2023
DOI: 10.1145/3603399
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Harmful Content on Online Platforms: What Platforms Need vs. Where Research Efforts Go

Abstract: The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more inclusive environment for their users. Researchers have developed different methods for automatically detecting har… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 78 publications
0
4
0
Order By: Relevance
“…Strategies and resources have also been put forward for identifying threatening content in low-resource languages [63,64]. Additionally, comprehensive surveys on threat detection techniques and moderation policies on tackling such content by online platforms have been conducted [65,66]. Many languages still lack sufficient linguistic resources for NLP-related tasks [67].…”
Section: Downstream Tasks In Hausa Languagementioning
confidence: 99%
“…Strategies and resources have also been put forward for identifying threatening content in low-resource languages [63,64]. Additionally, comprehensive surveys on threat detection techniques and moderation policies on tackling such content by online platforms have been conducted [65,66]. Many languages still lack sufficient linguistic resources for NLP-related tasks [67].…”
Section: Downstream Tasks In Hausa Languagementioning
confidence: 99%
“…7 Specifically, on May 23rd we queried Twitter for all the accounts that shared a tweet in UK-RU and FR-22, obtaining almost 2M users that were suspended by the platform for violating their rules. Twitter might suspend an account in a variety of circumstances that range from promoting violence and glorifying crime to hate speech, spam, and impersonation; similarly to other Big Tech platforms, these guidelines are considered among the most stringent [ 61 ]. More details about reasons for suspension are available in the Twitter documentation.…”
Section: Data Collectionmentioning
confidence: 99%
“…Research on this topic was motivated by the pressing need to create safer environments in social media platforms through strategies such as automatic content moderation (Weerasooriya et al 2023). With the goal of aiding content moderation, systems are trained to recognize a variety of related phenomena such as aggression, cyberbulling, hate speech, and toxicity (Arora et al 2023).…”
Section: Introductionmentioning
confidence: 99%