2011 Third International Conference on Advanced Computing 2011
DOI: 10.1109/icoac.2011.6165203
|View full text |Cite
|
Sign up to set email alerts
|

A survey on Short text analysis in Web

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(11 citation statements)
references
References 25 publications
0
9
0
Order By: Relevance
“…As social media posts are often short and may include mis-spelled words or irrelevant characters (such as emojis), social media text documents share an extremely low number of overlapping terms within a collection of posts. To address the sparsity problem, scholars have suggested alternative methods, such as LDA extension to author-topic model, and the dual LDA approach that relies on external knowledge bases like Wikipedia (Atefeh, and Khreich 2015;Nugroho et al 2020;Rafeeque and Sendhilkumar 2011).…”
Section: Latent Dirichlet Allocation (Lda)mentioning
confidence: 99%
“…As social media posts are often short and may include mis-spelled words or irrelevant characters (such as emojis), social media text documents share an extremely low number of overlapping terms within a collection of posts. To address the sparsity problem, scholars have suggested alternative methods, such as LDA extension to author-topic model, and the dual LDA approach that relies on external knowledge bases like Wikipedia (Atefeh, and Khreich 2015;Nugroho et al 2020;Rafeeque and Sendhilkumar 2011).…”
Section: Latent Dirichlet Allocation (Lda)mentioning
confidence: 99%
“…External knowledge can be taken from ontologies (wordNet, Wikipedia) [12,13] or from large scale datasets on which topic models techniques are applied (LDA, Latent Semantic Indexing [14]...). Song et al in [15] and Rafeeque et al in [16] summarize these different techniques and show some of their usages in short text classification. Several reduction methods exist: feature abstraction, feature selection and LDA.…”
Section: Related Workmentioning
confidence: 99%
“…Our proposal is based on the existence of a mechanism that is able to determine the topics discussed by the pieces of information that are exchanged in OSNs (i.e., the messages). Topics here can be categories predefined by the underlying OSN infrastructure [54]; usergenerated tags like Flickr categories [53]; or categories or tags extracted from images [66,80], videos [3], geolocation information [45], or text [57,74,12].…”
Section: Topic Analysismentioning
confidence: 99%
“…With regard to topic extraction from text, current research in the field of NLP has made advancements on analysing short messages present in OSN, microblogs, etc. For a review of these works see [57]. Specifically, there are some NLP proposals endowed with new textmining and analysis techniques that analyse short and informal messages with acceptable accuracy [74].…”
Section: Topic Analysismentioning
confidence: 99%