2020
DOI: 10.11591/ijeecs.v18.i3.pp1494-1500
|View full text |Cite
|
Sign up to set email alerts
|

Cyberbullying identification in twitter using support vector machine and information gain based feature selection

Abstract: <span>Cyberbullying is one of the actions that violate the ITE Law where the crime is committed on social media applications such as Twitter. This action is difficult to detect if no one is reporting the tweet. Cyberbullying tweet identification aims to classify tweets that contain bullying. Classification is done using Support Vector Machine method where this method aims to find the dividing hyperplane between negative and positive class. This study is a text classification where more data is used, the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(16 citation statements)
references
References 16 publications
0
16
0
Order By: Relevance
“…Machine learning (ML) based approaches with different feature selection methods are widely used in cyberbullying tweet classification. Purnamasari et al [26] utilized the SVM and Information Gain(IG) based feature selection method for detecting cyberbullying events in tweets. Muneer & Fati [11] used various classifiers, namely AdaBoost(ADB), Light Gradient Boosting Machine (LGBM), SVM, RF, Stochastic Gradient Descent (SGD), Logistic Regression (LR), and MNB, and for cyberbullying events identification in tweets.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Machine learning (ML) based approaches with different feature selection methods are widely used in cyberbullying tweet classification. Purnamasari et al [26] utilized the SVM and Information Gain(IG) based feature selection method for detecting cyberbullying events in tweets. Muneer & Fati [11] used various classifiers, namely AdaBoost(ADB), Light Gradient Boosting Machine (LGBM), SVM, RF, Stochastic Gradient Descent (SGD), Logistic Regression (LR), and MNB, and for cyberbullying events identification in tweets.…”
Section: Related Workmentioning
confidence: 99%
“…The input dataset and the data annotation are described in sections III-A and III-B. Two baseline cyberbullying models based on deep learning, namely Bi-LSTM [21], RNN [21], and three baseline cyberbullying models based on machine learning models, namely, SVM [26], Multinomial Naive Bayes (MNB) [11], and RF [11] are used for the comparison with the proposed DEA-RNN model. These models have been selected from state-of-the-art cyberbullying detection in social media.…”
Section: Ivexperimental Analysismentioning
confidence: 99%
“…It is evident that in many cases [25] similar to this, random forest is a good model. Though SVM has been proven to work better in some special cases, such as fault classification in smart distribution network [26], ozone prediction [27], cyberbullying identification [28], harmonic source identification [29], etc. In Table 2, we can see that CNN did good.…”
Section: Fig 3 Comparative Visualization Of Accelerometer and Gyroscope Data Points In Different Activitiesmentioning
confidence: 99%
“…The input variables that maximize the information gain are selected which in turn minimizes the entropy and best splits the dataset into groups for efficient classification. Information gain is very effectively used in various researches for Twitter sentiment classification also [29]- [30] gain is biased for the input feature with higher number of distinct values. The (1) gives the formula for IG calculation as given under [31]:…”
Section: Information Gainmentioning
confidence: 99%