2019
DOI: 10.1109/access.2018.2886366
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Hot Topic Detection Framework With Integration of Image and Short Text Information From Twitter

Abstract: Twitter exhibits several characteristics, including a limited number of features and noisy text information. Extracting valuable information from Twitter has made hot topic detection a challenging task. In this paper, a novel four-stage framework is proposed to improve the performance of topic detection. Data preprocessing is the first stage. Deep learning is then exploited to enrich short text information via image understanding. Next, improved latent Dirichlet allocation is used to optimize the image effecti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 23 publications
0
8
0
Order By: Relevance
“…Kireyev et al proposed a modified LDA method to identify topics from tweets [35]. Zhang et al proposed an approach to obtaining hot topics from Twitter [36]. Furthermore, Weng et al merged tweets into a single document and applied the original LDA method [37].…”
Section: 1mentioning
confidence: 99%
“…Kireyev et al proposed a modified LDA method to identify topics from tweets [35]. Zhang et al proposed an approach to obtaining hot topics from Twitter [36]. Furthermore, Weng et al merged tweets into a single document and applied the original LDA method [37].…”
Section: 1mentioning
confidence: 99%
“…Latent Dirichlet allocation (LDA) is an unsupervised machine learning technique based on a probabilistic graph model [29]. It can discover potential topics in large-scale document sets, and it works well in text mining, especially short text processing [30]. Supposing the set of documents is D, the set of topics is T, and the vocabulary in the document is denoted by w, then the relationship between the topic, document, and vocabulary of LDA can be expressed by the following formula [31]:…”
Section: Foundation Theory For Ldamentioning
confidence: 99%
“…In the traditional text field, a large number of scholars focus on topic detection and tracking based on news articles. A lot of outstanding contributions have been made and fruitful results have been achieved (e.g., [2], [7]- [16]). With the extensive application of pictures and videos, topic detection and tracking based on multimedia data has gradually become popular.…”
Section: B Tdtmentioning
confidence: 99%