2021
DOI: 10.3390/s22010155
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of Machine Learning and Sentiment Analysis in Detection of Suspicious Online Reviewers on Different Type of Data

Abstract: The article focuses on solving an important problem of detecting suspicious reviewers in online discussions on social networks. We have concentrated on a special type of suspicious authors, on trolls. We have used methods of machine learning for generation of detection models to discriminate a troll reviewer from a common reviewer, but also methods of sentiment analysis to recognize the sentiment typical for troll’s comments. The sentiment analysis can be provided also using machine learning or lexicon-based a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(15 citation statements)
references
References 25 publications
(29 reference statements)
1
9
0
Order By: Relevance
“…For collecting datasets for this research, we used publicly available troll online reviewer dataset developed and created by Machova et al [ 46 ]. This dataset have been collected from Reddit platform, and it concerned with online political discussion.…”
Section: Methodsmentioning
confidence: 99%
“…For collecting datasets for this research, we used publicly available troll online reviewer dataset developed and created by Machova et al [ 46 ]. This dataset have been collected from Reddit platform, and it concerned with online political discussion.…”
Section: Methodsmentioning
confidence: 99%
“…Based on our previous works [ 9 , 13 ] we focused on the most successful methods of deep learning, namely convolutional as well as recurrent networks.…”
Section: Methodsmentioning
confidence: 99%
“…There are two different approaches to detecting disinformation. The first is using machine learning methods to train models for the identification of authors of disinformation [ 9 ] or to focus on toxicity in texts of conversational content, such as in the article [ 10 ] which analyzes hate speech using a web interface with focus on the most popular social networks such as Twitter, YouTube, and Facebook. Then, it is important to decide whether to use strong methods of deep learning or to use ensemble learning, which can work effectively even with weak classifiers.…”
Section: Introductionmentioning
confidence: 99%
“…The C4.5 algorithm is used in our work. The main limitation of decision trees is that they are prone to overfitting by creating overcomplicated models that view the feature of the training set as all data characteristics [ 43 ]. The random forest can avoid this problem.…”
Section: Other Da Techniques and Machine Learning Algorithmsmentioning
confidence: 99%