2018 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI) 2018
DOI: 10.1109/isriti.2018.8864484
|View full text |Cite
|
Sign up to set email alerts
|

SEMAR: An Interface for Indonesian Hate Speech Detection Using Machine Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…For text classification using the machine-learning approach, researchers have used several models to classify whether a text contain hate speech and abusive language or not including Naive Bayes (NB) [5] , [44] , [1] , [20] , [40] , [4] , [24] , [25] , [38] , Support Vector Machine (SVM) [5] , [34] , [44] , [1] , [20] , [40] , [7] , [24] , [25] , [38] , [27] , Logistic Regression (LR) [5] , [39] , [44] , [40] , [7] , [27] , Decision Tree (DT) [44] , Random Forest Decision Tree (RFDT) [5] , [39] , [1] , [20] , [7] , [24] , [25] , [38] , [27] , k-Nearest Neighbor (kNN) [34] , [44] , Latent Semantic Analysis (LSA) [3] , Maximum Entropy [20] , [19] , and Artificial Neural Network (ANN) [49] . These machine-learning models are usually combined with several text features including word n-grams [5] , [39] , [1] , [40] , [7] , [49] , [4] , [24] , [25] , [38] , [27] , character n-grams [5] , [39] , [1] , [40] , ...…”
Section: Methodsmentioning
confidence: 99%
“…For text classification using the machine-learning approach, researchers have used several models to classify whether a text contain hate speech and abusive language or not including Naive Bayes (NB) [5] , [44] , [1] , [20] , [40] , [4] , [24] , [25] , [38] , Support Vector Machine (SVM) [5] , [34] , [44] , [1] , [20] , [40] , [7] , [24] , [25] , [38] , [27] , Logistic Regression (LR) [5] , [39] , [44] , [40] , [7] , [27] , Decision Tree (DT) [44] , Random Forest Decision Tree (RFDT) [5] , [39] , [1] , [20] , [7] , [24] , [25] , [38] , [27] , k-Nearest Neighbor (kNN) [34] , [44] , Latent Semantic Analysis (LSA) [3] , Maximum Entropy [20] , [19] , and Artificial Neural Network (ANN) [49] . These machine-learning models are usually combined with several text features including word n-grams [5] , [39] , [1] , [40] , [7] , [49] , [4] , [24] , [25] , [38] , [27] , character n-grams [5] , [39] , [1] , [40] , ...…”
Section: Methodsmentioning
confidence: 99%
“…Before classifying the data, it is necessary to carry out several preprocessing procedures. Case folding involves changing words in a text into uniform lowercase letters to facilitate further processing [18,19]. Stop Word Removal, stop word is a common word that often appears in a sentence but has no meaning [18].…”
Section: Preprocessingmentioning
confidence: 99%
“…Case folding involves changing words in a text into uniform lowercase letters to facilitate further processing [18,19]. Stop Word Removal, stop word is a common word that often appears in a sentence but has no meaning [18]. Removing stop words can increase the signal-to-noise ratio in unstructured text and thus increase the statistical significance of terms that may be important for a specific task [20].…”
Section: Preprocessingmentioning
confidence: 99%
“…We carefully reviewed each document to obtain the key information of each work. In this part, we focus on [11], [30], [17], [23], [12], [28], [27], [21], [31], [32], [33], [34], [35], [36], [37], [38], [39], [40]…”
Section: B What Has Been Done So Far In Indonesian Abusive Language D...mentioning
confidence: 99%