2022
DOI: 10.7717/peerj-cs.1059
|View full text |Cite
|
Sign up to set email alerts
|

Investigating toxicity changes of cross-community redditors from 2 billion posts and comments

Abstract: This research investigates changes in online behavior of users who publish in multiple communities on Reddit by measuring their toxicity at two levels. With the aid of crowdsourcing, we built a labeled dataset of 10,083 Reddit comments, then used the dataset to train and fine-tune a Bidirectional Encoder Representations from Transformers (BERT) neural network model. The model predicted the toxicity levels of 87,376,912 posts from 577,835 users and 2,205,581,786 comments from 890,913 users on Reddit over 16 yea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 52 publications
0
2
0
Order By: Relevance
“…As many as six articles focus on sentiment analysis for different purposes ( Smetanin, 2022 ; Pratama & Firmansyah, 2022 ; Baxi, Philip & Mago, 2022 ; Nguyen & Gokhale, 2022 ; Shamoi et al., 2022 ; Ali, Irfan & Lashari, 2023 ). Four studies focused on tackling online harms of different kinds, with studies on abusive language detection ( Almerekhi, Kwak & Jansen, 2022 ; Ramponi et al., 2022 ), suicidal ideation detection ( Baghdadi et al., 2022 ) and misinformation detection ( Obeidat et al., 2022 ). Others studied NLP techniques for social media , focused on the analysis of Twitter discourse ( Heaton et al., 2023 ), language identification ( Hidayatullah et al., 2023 ) and named entity recognition ( Fudholi et al., 2023 ).…”
Section: Special Issue Themesmentioning
confidence: 99%
See 1 more Smart Citation
“…As many as six articles focus on sentiment analysis for different purposes ( Smetanin, 2022 ; Pratama & Firmansyah, 2022 ; Baxi, Philip & Mago, 2022 ; Nguyen & Gokhale, 2022 ; Shamoi et al., 2022 ; Ali, Irfan & Lashari, 2023 ). Four studies focused on tackling online harms of different kinds, with studies on abusive language detection ( Almerekhi, Kwak & Jansen, 2022 ; Ramponi et al., 2022 ), suicidal ideation detection ( Baghdadi et al., 2022 ) and misinformation detection ( Obeidat et al., 2022 ). Others studied NLP techniques for social media , focused on the analysis of Twitter discourse ( Heaton et al., 2023 ), language identification ( Hidayatullah et al., 2023 ) and named entity recognition ( Fudholi et al., 2023 ).…”
Section: Special Issue Themesmentioning
confidence: 99%
“… Almerekhi, Kwak & Jansen (2022) investigated changes in online behaviour of users who publish in multiple communities on Reddit by measuring their toxicity levels. They first automatically labelled a large collection of over 87 million posts as toxic or non-toxic, which they then analysed.…”
Section: Summary Of Contributionsmentioning
confidence: 99%
“…[10] developed a framework that supports airlines in addressing customer complaints and improving services during global events like the COVID-19 pandemic through social media sentiment analysis focusing on sarcasm detection. [11] analyzed 2 billion posts and comments from Reddit to identify toxic comments. The authors hope to bring more awareness to the online harassment problem experienced by many people nowadays and potentially prevent toxic behavior on social networks.…”
Section: Related Work a Opinion Mining In Social Media 1) Overviewmentioning
confidence: 99%
“…Granger causality has also been accepted in the areas of business, management, accounting, and economics [ 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 ]. The contribution of Granger causality has also been noticed in computer science [ 42 , 43 , 44 , 45 , 46 ] and engineering [ 47 , 48 , 49 , 50 , 51 ].…”
Section: Introductionmentioning
confidence: 99%