2017
DOI: 10.1002/asi.23980
|View full text |Cite
|
Sign up to set email alerts
|

Improving interpretations of topic modeling in microblogs

Abstract: Topic models were proposed to detect the underlying semantic structure of large collections of text documents to facilitate the process of browsing and accessing documents with similar ideas and topics. Applying topic models to short text documents to extract meaningful topics is challenging. The problem becomes even more complicated when dealing with short and noisy micro‐posts in Twitter that are about one general topic. In such a case, the goal of applying topic models is to extract subtopics. This results … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0
2

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 25 publications
0
3
0
2
Order By: Relevance
“…Topic Modeling: This method discloses the hidden semantic structure of tweets. The studies in this theme have proposed customized topic models for Twitter data or utilized already developed topic models such as incorporating Twitter-LDA, WordNet, and hashtags to enhance the quality of topic-discovery [81], developing a topic model based on high utility pattern mining to detect emerging topics [82], utilizing an already developed method, called hierarchical Dirichlet processes (HDP), to detect all posts related a given event [83], proposing a model based on Latent Dirichlet Allocation (LDA) to extract key phrases for social media events [84], and integrating the recurrent Chinese restaurant process and word co-occurrence analysis to propose a nonparametric topic model for short text documents such as tweets [85]. It is worth mentioning that the studies that utilized preexisting topic models used a qualitative approach for coding topics.…”
Section: The Second Category Represented Common Research Paperrelatedmentioning
confidence: 99%
“…Topic Modeling: This method discloses the hidden semantic structure of tweets. The studies in this theme have proposed customized topic models for Twitter data or utilized already developed topic models such as incorporating Twitter-LDA, WordNet, and hashtags to enhance the quality of topic-discovery [81], developing a topic model based on high utility pattern mining to detect emerging topics [82], utilizing an already developed method, called hierarchical Dirichlet processes (HDP), to detect all posts related a given event [83], proposing a model based on Latent Dirichlet Allocation (LDA) to extract key phrases for social media events [84], and integrating the recurrent Chinese restaurant process and word co-occurrence analysis to propose a nonparametric topic model for short text documents such as tweets [85]. It is worth mentioning that the studies that utilized preexisting topic models used a qualitative approach for coding topics.…”
Section: The Second Category Represented Common Research Paperrelatedmentioning
confidence: 99%
“…For the validity, we incorporate a two-stage process, in which first we include LDA, which provides topics and their associated words. These words are then mapped with the help of WordNet, a lexical database of English (Fellbaum, 2005 ), for a better representation of each topic, leading to a valid representation of topics and their associated words (Alkhodair et al, 2018 ). As for the reliability, we employ inter-coder reliability, in which one author has assigned each topic to its closely associated constructs, followed by other authors who mark the given relations.…”
Section: Study 1: Exploratory Analysismentioning
confidence: 99%
“…LDA algoritem je bil uspešno uporabljen na različnih področjih tudi pri analizi krajših besedilnih dokumentov kot so objave na spletnih forumih in socialnih (Alkhodair, Fung, Rahman in Hung, 2018;Rashad, Mohammed, El-Midany, Kandil in Ibrahim, 2007). Algoritem je implementiran v knjižnico lda programa R (Chang, 2015).…”
Section: Statistična Analizaunclassified