2014
DOI: 10.1007/s00453-014-9909-1
|View full text |Cite
|
Sign up to set email alerts
|

A Spectral Algorithm for Latent Dirichlet Allocation

Abstract: The problem of topic modeling can be seen as a generalization of the clustering problem, in that it posits that observations are generated due to multiple latent factors (e.g., the words in each document are generated as a mixture of several active topics, as opposed to just one). This increased representational power comes at the cost of a more challenging unsupervised learning problem of estimating the topic probability vectors (the distributions over words for each topic), when only the words are observed a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
223
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 117 publications
(226 citation statements)
references
References 47 publications
3
223
0
Order By: Relevance
“…The first two are proved in Anandkumar et al [6], and the third can be proved by a similar proof of Theorem 4.1 in Anandkumar et al [7]. We omit the details here.…”
Section: Algorithm 3: Tensor Orthogonal Decomposition (Tod)mentioning
confidence: 71%
See 4 more Smart Citations
“…The first two are proved in Anandkumar et al [6], and the third can be proved by a similar proof of Theorem 4.1 in Anandkumar et al [7]. We omit the details here.…”
Section: Algorithm 3: Tensor Orthogonal Decomposition (Tod)mentioning
confidence: 71%
“…This gives a higher value to a phrase with scores of (3,3,3) than to one with scores of (1,3,5), though the average score is identical. Finally, IdealScore K is calculated using the scores of the top K phrases out of all judged phrases.…”
Section: User Study and Quantitative Resultsmentioning
confidence: 99%
See 3 more Smart Citations