2021
DOI: 10.1088/1757-899x/1085/1/012008
|View full text |Cite
|
Sign up to set email alerts
|

Performance analysis of Word Embeddings for Cyberbullying Detection

Abstract: Cyber bullying activities are increasing day by day with the increase of Social Media Platforms such as Face book, Twitter, Instagram etc. Bullies take the advantage of these large online connected platforms due to which it became as a big challenging task in Natural Language Processing (NLP). In this paper, we compare the performance of various word embedding methods from basic word embedding methods to recent advanced language models such as RoBERTa, XLNET, ALBERT, etc. for cyberbullying detection. We used L… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 6 publications
0
9
0
Order By: Relevance
“…There was consistent outperformance by SBERT when the embeddings were used as the input across the classifiers, which was expected since it worked on the semantic textual similarity (STS) benchmark. The performance of static word embedding (i.e., Word2Vec, GloVe, fast-Text) was not as optimal as the contextual embeddings from the language models (i.e, RoBerta, XLNet, Albert) when coupled with classifiers for cyberbullying detection [87].…”
Section: F Features Used In Automated Cyberbullying Detectionmentioning
confidence: 98%
“…There was consistent outperformance by SBERT when the embeddings were used as the input across the classifiers, which was expected since it worked on the semantic textual similarity (STS) benchmark. The performance of static word embedding (i.e., Word2Vec, GloVe, fast-Text) was not as optimal as the contextual embeddings from the language models (i.e, RoBerta, XLNet, Albert) when coupled with classifiers for cyberbullying detection [87].…”
Section: F Features Used In Automated Cyberbullying Detectionmentioning
confidence: 98%
“…An important factor in classifying texts, according to the machine learning models is to digitize them [23]. To train any machine learning classifiers, the input data needs to be in numerical format [24]. By applying various feature extraction techniques, every text information needs to be converted into a numerical representation.…”
Section: Feature Extractionmentioning
confidence: 99%
“…Feature extraction is one of the crucial parts in machine learning modelling because the performance of classifier depends on features used during the classification process [63]. The current study chose four types of features in the category of content-based features only for a comparison of the model performance evaluation before we proceed and integrate another feature category (e.g.…”
Section: Feature Extractionmentioning
confidence: 99%
“…In NLP tasks, machine learning algorithms depend on word embedding. Thus, we also made word embedding one of the features to convert the text for each message in the data set to numeric form [63]. We used pre-trained Word2vec…”
Section: Word Embeddingmentioning
confidence: 99%