Proceedings of the 17th International Conference on World Wide Web 2008
DOI: 10.1145/1367497.1367512
|View full text |Cite
|
Sign up to set email alerts
|

Topic modeling with network regularization

Abstract: In this paper, we formally define the problem of topic modeling with network structure (TMN). We propose a novel solution to this problem, which regularizes a statistical topic model with a harmonic regularizer based on a graph structure in the data. The proposed method combines topic modeling and social network analysis, and leverages the power of both statistical topic models and discrete regularization. The output of this model can summarize well topics in text, map a topic onto the network, and discover to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
184
0
3

Year Published

2010
2010
2017
2017

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 320 publications
(188 citation statements)
references
References 26 publications
1
184
0
3
Order By: Relevance
“…A natural approach to network-based topic modeling is to add a network-based regularization constraint to traditional topic models such as NetPLSA [65]. The relational topic model (RTM) proposed in [23] tries to model the generation of documents and links sequentially.…”
Section: Clustering Text In Networkmentioning
confidence: 99%
“…A natural approach to network-based topic modeling is to add a network-based regularization constraint to traditional topic models such as NetPLSA [65]. The relational topic model (RTM) proposed in [23] tries to model the generation of documents and links sequentially.…”
Section: Clustering Text In Networkmentioning
confidence: 99%
“…For example, the Author-Conference-Topic model (ACT) [14] treats authors and venues as probability distributions over topics extracted by means of an unsupervised learning technique. Mei et al [15] propose a framework to model topics by regularizing a statistical topic model through a harmonic regularizer, which is based on a graph structure. Differently from these methods, we exploit an automatically generated knowledge base [3] to characterize research topics semantically and we use this as the basis for associating a diachronic semantic topic distribution with each author.…”
Section: State Of the Artmentioning
confidence: 99%
“…For the more challenging Cora dataset, Table 3 shows the results on Cora dataset. We compare the performance of RankCompete and with the state-of-the-arts algorithms including Normalized Cut(NC) [7], NetPLSA [11], RankClus [8] and iTopicModel [10]. The results show that our algorithm outperforms the state of the art algorithms in clustering information networks.…”
Section: Bibliography Networkmentioning
confidence: 99%