2014
DOI: 10.2139/ssrn.3199099
|View full text |Cite
|
Sign up to set email alerts
|

Linked Knowledge Sources for Topic Classification of Microposts: A Semantic Graph-Based Approach

Abstract: Short text messages a.k.a Microposts (e.g. Tweets) have proven to be an effective channel for revealing information about trends and events, ranging from those related to Disaster (e.g. hurricane Sandy) to those related to Violence (e.g. Egyptian revolution). Being informed about such events as they occur could be extremely important to authorities and emergency professionals by allowing such parties to immediately respond.In this work we study the problem of topic classification (TC) of Microposts, which aims… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0
5

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 30 publications
0
6
0
5
Order By: Relevance
“…Applying topic modeling methods such as LDA and ToT to extract topics from tweets might suffer from the sparsity problem (Sriram et al, ; Cheng et al, ), because they are designed for regular documents and not short, noisy, and informal texts like tweets. As suggested in Varga et al (), to obtain better topics from Twitter without modifying the standard topic detection methods, we annotate each tweet boldmdouble-struckM with concepts defined in Wikipedia using an existing semantic annotator. We see each concept as a term in the set double-struckW.…”
Section: Methodsmentioning
confidence: 99%
“…Applying topic modeling methods such as LDA and ToT to extract topics from tweets might suffer from the sparsity problem (Sriram et al, ; Cheng et al, ), because they are designed for regular documents and not short, noisy, and informal texts like tweets. As suggested in Varga et al (), to obtain better topics from Twitter without modifying the standard topic detection methods, we annotate each tweet boldmdouble-struckM with concepts defined in Wikipedia using an existing semantic annotator. We see each concept as a term in the set double-struckW.…”
Section: Methodsmentioning
confidence: 99%
“…External information can be particularly useful in domains where the available documents are short and do not thus contain much information. To this end, Varga, Cano Basave, Rowe, Ciravegna, and He (2014) report significant improvement in performance when the content of tweets is linked to knowledge graphs as opposed to lexical-only content contained in the input tweets.…”
Section: Textual Datamentioning
confidence: 99%
“…In a study addressing the problem of classifying microposts, a novel meta-graph was presented to ensure a more fine-grained classification of concepts providing a set of novel semantic features. 117 In another study, Yang et al proposed a new and unsupervised topic model called Trend-Sensitive-Latent Dirichlet Allocation (TS-LDA) that can effectively extract latent topics from contents and tried to find interesting tweets based on topic identification by modeling temporal trends on Twitter over time. 118 LDA is a fundamental method of topic semantic modeling that discovers latent topics from the documents.…”
Section: Semantic Text Analysismentioning
confidence: 99%