Examining the Coherence of the Top Ranked Tweet Topics

Fang, Anjie; Macdonald, Craig; Ounis, Iadh; Habel, Philip

doi:10.1145/2911451.2914731

Cited by 17 publications

(14 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In general, the coherence values increased along with the number of topics as aforementioned. The result is in line with that in the data mining literature [49]. Conversely, the mean value of spatial autocorrelation decreased.…”

Section: Inferring Activity Types and Entangled Urban Functionssupporting

confidence: 91%

See 1 more Smart Citation

Profiling the Spatial Structure of London: From Individual Tweets to Aggregated Functional Zones

Zhong

Zeng

et al. 2018

IJGI

View full text Add to dashboard Cite

Knowledge discovery about people and cities from emerging location data has been an active research field but is still relatively unexplored. In recent years, a considerable amount of work has been developed around the use of social media data, most of which focusses on mining the content, with comparatively less attention given to the location information. Furthermore, what aggregated scale spatial patterns show still needs extensive discussion. This paper proposes a tweet-topic-function-structure framework to reveal spatial patterns from individual tweets at aggregated spatial levels, combining an unsupervised learning algorithm with spatial measures. Two-year geo-tweets collected in Greater London were analyzed as a demonstrator of the framework and as a case study. The results indicate, at a disaggregated level, that the distribution of topics possess a fair degree of spatial randomness related to tweeting behavior. When aggregating tweets by zones, the areas with the same topics form spatial clusters but of entangled urban functions. Furthermore, hierarchical clustering generates a clear spatial structure with orders of centers. Our work demonstrates that although uncertainties exist, geo-tweets should still be a useful resource for informing spatial planning, especially for the strategic planning of economic clusters.

show abstract

Section: Inferring Activity Types and Entangled Urban Functionssupporting

confidence: 91%

“…In general, the higher the coherence value, the better the quality of the topics. We implemented a grid search of best topic models and ended up with the same conclusion as that in [49]. Increasing the number of topics (T) leads to higher coherence values.…”

Section: Indicators For Select the Number Of Topicsmentioning

confidence: 99%

Profiling the Spatial Structure of London: From Individual Tweets to Aggregated Functional Zones

Zhong

Zeng

et al. 2018

IJGI

View full text Add to dashboard Cite

show abstract

“…We show the coherence of the topic models extracted from the two candidate communities in Appendix Figure B1. The coherence results are consistent with Fang, MacDonald, Ounis, and Habel (2016a): the average coherence of a topic model decreases when the number of topics increases; however, the increasing line of c@10/20/30 in Figure B1 indicates that the top-ranked topics in a topic model are much easier to understand as K increases. Among proClinton topic models, we found the coherence (c@10/20/30) of topics becomes stable when K reaches 70, and for proTrump, when K reaches 60.…”

Section: Discussionsupporting

confidence: 74%

Votes on Twitter: Assessing Candidate Preferences and Topics of Discussion During the 2016 U.S. Presidential Election

et al. 2019

Self Cite

View full text Add to dashboard Cite

Social media offers scholars new and innovative ways of understanding public opinion, including citizens’ prospective votes in elections and referenda. We classify social media users’ preferences over the two U.S. presidential candidates in the 2016 election using Twitter data and explore the topics of conversation among proClinton and proTrump supporters. We take advantage of hashtags that signaled users’ vote preferences to train our machine learning model which employs a novel classifier—a Topic-Based Naive Bayes model—that we demonstrate improves on existing classifiers. Our findings demonstrate that we are able to classify users with a high degree of accuracy and precision. We further explore the similarities and divergences among what proClinton and proTrump users discussed on Twitter.

show abstract

“…Meanwhile, we also examine the top 2/7 2 most coherent topics in a model for more effective coherence evaluation, i.e. C@2 & C@7 metrics, following to [24].…”

Section: Methodsmentioning

confidence: 99%

Exploring Time-Sensitive Variational Bayesian Inference LDA for Social Media Data

Fang

Macdonald

Ounis

et al. 2017

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. There is considerable interest among both researchers and the mass public in understanding the topics of discussion on social media as they occur over time. Scholars have thoroughly analysed samplingbased topic modelling approaches for various text corpora including social media; however, another LDA topic modelling implementationVariational Bayesian (VB)-has not been well studied, despite its known efficiency and its adaptability to the volume and dynamics of social media data. In this paper, we examine the performance of the VB-based topic modelling approach for producing coherent topics, and further, we extend the VB approach by proposing a novel time-sensitive Variational Bayesian implementation, denoted as TVB. Our newly proposed TVB approach incorporates time so as to increase the quality of the generated topics. Using a Twitter dataset covering 8 events, our empirical results show that the coherence of the topics in our TVB model is improved by the integration of time. In particular, through a user study, we find that our TVB approach generates less mixed topics than state-of-the-art topic modelling approaches. Moreover, our proposed TVB approach can more accurately estimate topical trends, making it particularly suitable to assist end-users in tracking emerging topics on social media.

show abstract

Examining the Coherence of the Top Ranked Tweet Topics

Cited by 17 publications

References 12 publications

Profiling the Spatial Structure of London: From Individual Tweets to Aggregated Functional Zones

Profiling the Spatial Structure of London: From Individual Tweets to Aggregated Functional Zones

Votes on Twitter: Assessing Candidate Preferences and Topics of Discussion During the 2016 U.S. Presidential Election

Exploring Time-Sensitive Variational Bayesian Inference LDA for Social Media Data

Contact Info

Product

Resources

About