2021
DOI: 10.1016/j.is.2021.101740
|View full text |Cite
|
Sign up to set email alerts
|

A real-time deep-learning approach for filtering Arabic low-quality content and accounts on Twitter

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(10 citation statements)
references
References 16 publications
0
9
0
Order By: Relevance
“…LSTM with the Word2Vec model achieves an F1-score of 98.03% for word segmentation in the Arabic language (Almuhareb et al 2019 ). Neural network-based word embedding efficiently models a word and its context and has become one of the most widely used methods of word distribution representation (N.H. Phat and Anh 2020 )(Alharthi et al 2021 ).…”
Section: Review On Text Analytics Word Embedding Application and Deep...mentioning
confidence: 99%
See 1 more Smart Citation
“…LSTM with the Word2Vec model achieves an F1-score of 98.03% for word segmentation in the Arabic language (Almuhareb et al 2019 ). Neural network-based word embedding efficiently models a word and its context and has become one of the most widely used methods of word distribution representation (N.H. Phat and Anh 2020 )(Alharthi et al 2021 ).…”
Section: Review On Text Analytics Word Embedding Application and Deep...mentioning
confidence: 99%
“… Craja et al ( 2020 ) Annual report analysis for fraud detection EDGAR database LR, RF, SVM, XGB, ANN, HAN Word2Vec HAN achieves an accuracy of 84.57% 2. Alharthi et al ( 2021 ) Arabic text low-quality content classification Twitter dataset CNN, LSTM Word2Vec, AraVec LSTM achieves an accuracy of 98% 3. Kozlowski et al ( 2020 ) French social media tweet analysis for crisis management French dataset SVM, CNN fastText, BERT, French FlauBert FlauBert achieves a micro F1-score of 85.4% 4.…”
Section: Appendix Amentioning
confidence: 99%
“…CNN works by shaping input data into a two-dimensional matrix, like time series or image pixels. Another standard layer in DL is the long short-term memory (LSTM), which learns long-term dependencies in sequential data such as text and time series [16].…”
Section: A Deep Learningmentioning
confidence: 99%
“…The study's findings showed that using CNN-based deep learning models is suitable for various large datasets and binary classification of a specific task. Alharthi et al [16] presented a real-time model to identify low-quality tweets and accounts that make these contents on Twitter. They extracted the Arabic dataset from Twitter using Twitter API and then applied both CNN and LSTM techniques.…”
Section: B Arabic Text Classificationmentioning
confidence: 99%
“…Authors used various classi er algorithms as RF, Decision Trees (DT), Decision Table, Random Tree, KStar, Bayes Net and Simple Logistic, Content . They calculated classification, Accuracy-94.5% FPR-4.1% FNR-6.6% for spam dataset[11].Alharthi,Alhothali and Moria collected 10,000 Arabic tweets dataset for prediction.Authors used machine learning algorithms with Long Short Term Memory and word embedding feature representation.The system classification accuracy depends on tweet length and evaluated values 0.97,0.98,0.95 for Accuracy, Precision and Recall respectively[12].Liu, Pang and Wang used 97,839 Restaurant with 31,317 Hotel review dataset for classi cation. They used Machine Learning techniques, Bi-LSTM and multimodal neural network model to analyze the effective features to improve the performance, Recall-0.80 Precision-0.82 F1-score-0.81[13].Saidani, Adi and Allili organized dataset from, Enron Corpus consisting of 2,893 messages with 2,412…”
mentioning
confidence: 99%