Mark Belford scite author profile

Mark Belford

3Publications

88Citation Statements Received

80Citation Statements Given

How they've been cited

How they cite others

Affiliations

University College Dublin

Publications

Order By: Most citations

Stability of topic modeling via matrix factorization

Belford

Namee

Greene

2018

Expert Systems with Applications

View full text Add to dashboard Cite

Topic models can provide us with an insight into the underlying latent structure of a large corpus of documents. A range of methods have been proposed in the literature, including probabilistic topic models and techniques based on matrix factorization. However, in both cases, standard implementations rely on stochastic elements in their initialization phase, which can potentially lead to different results being generated on the same corpus when using the same parameter values. This corresponds to the concept of "instability" which has previously been studied in the context of k-means clustering. In many applications of topic modeling, this problem of instability is not considered and topic models are treated as being definitive, even though the results may change considerably if the initialization process is altered. In this paper we demonstrate the inherent instability of popular topic modeling approaches, using a number of new measures to assess stability. To address this issue in the context of matrix factorization for topic modeling, we propose the use of ensemble learning strategies. Based on experiments performed on annotated text corpora, we show that a K-Fold ensemble strategy, combining both ensembles and structured initialization, can significantly reduce instability, while simultaneously yielding more accurate topic models.

show abstract

Ensemble topic modeling using weighted term co-associations

Belford

Greene

2020

Expert Systems with Applications

View full text Add to dashboard Cite

Stability of Topic Modeling via Matrix Factorization

Belford¹,

Namee²,

Greene³

2017

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mark Belford

Stability of topic modeling via matrix factorization

Ensemble topic modeling using weighted term co-associations

Stability of Topic Modeling via Matrix Factorization

Contact Info

Product

Resources

About