2018
DOI: 10.1007/978-981-10-8633-5_3
|View full text |Cite
|
Sign up to set email alerts
|

Classification of Short Text Using Various Preprocessing Techniques: An Empirical Evaluation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…For some specific applications, such as detecting spamming accounts [16], even more structured user features, such as the URL rate and the interaction rate, are believed to be highly informative. Interestingly, a recent study [14] has reversed the prediction logic and based the analysis on replies, but this approach struggled to predict the popularity of the original source tweet.…”
Section: High or Low Number Of Repliesmentioning
confidence: 99%
“…For some specific applications, such as detecting spamming accounts [16], even more structured user features, such as the URL rate and the interaction rate, are believed to be highly informative. Interestingly, a recent study [14] has reversed the prediction logic and based the analysis on replies, but this approach struggled to predict the popularity of the original source tweet.…”
Section: High or Low Number Of Repliesmentioning
confidence: 99%
“…The raw comment data obtained from RTV SLO contained a lot of unnecessary information, such as text formatting tags (for italics and bold text), hyperlinks, and metadata from other cited comments. As leaving the text formatting tags in the comments may dilute the information in the comments and therefore potentially worsen the resulting models, we decided to remove them from the texts [25,26].…”
Section: Data Preprocessingmentioning
confidence: 99%