2011
DOI: 10.1214/11-aos887
|View full text |Cite
|
Sign up to set email alerts
|

Spectral clustering and the high-dimensional stochastic blockmodel

Abstract: Networks or graphs can easily represent a diverse set of data sources that are characterized by interacting units or actors. Social networks, representing people who communicate with each other, are one example. Communities or clusters of highly connected actors form an essential feature in the structure of several empirical networks. Spectral clustering is a popular and computationally feasible method to discover these communities. The stochastic blockmodel [Social Networks 5 (1983) 109--137] is a social netw… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

18
872
1

Year Published

2012
2012
2019
2019

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 780 publications
(902 citation statements)
references
References 53 publications
18
872
1
Order By: Relevance
“…Moreover, RMT has been utilized to understand the analytic properties of compressed sensing (Cai and Jiang, 2011;Vershynin, 2012). Effective uses of concepts and tools from RMT have also been made in the analysis of spectral clustering methods with a growing number of clusters (Rohe et al, 2011), in community detection problems in large random networks (Nadakuditi and Newman, 2012), and in kernel PCA methods for dimension reduction when the observations are high-dimensional and the argument of the kernel function depends on the inner products of pairs of observations (El Karoui, 2010b,a).…”
Section: Other Applicationsmentioning
confidence: 99%
“…Moreover, RMT has been utilized to understand the analytic properties of compressed sensing (Cai and Jiang, 2011;Vershynin, 2012). Effective uses of concepts and tools from RMT have also been made in the analysis of spectral clustering methods with a growing number of clusters (Rohe et al, 2011), in community detection problems in large random networks (Nadakuditi and Newman, 2012), and in kernel PCA methods for dimension reduction when the observations are high-dimensional and the argument of the kernel function depends on the inner products of pairs of observations (El Karoui, 2010b,a).…”
Section: Other Applicationsmentioning
confidence: 99%
“…In particular, for a graph with n nodes, previous theoretical analyses for spectral clustering, under the SBM and its extensions, [23], [7], [25], [10] assumed that the minimum degree of the graph scales at least by a polynomial power of log n. Even when this assumption is satisfied, the dependence on the minimum degree is highly restrictive when it comes to making inferences about cluster recovery. Our analysis provides cluster recovery results that potentially do not depend on the above mentioned constraint on the minimum degree.…”
Section: Introductionmentioning
confidence: 99%
“…Denoting τ as the regularization parameter, previous theoretical analyses of regularization ( [7], [23]) provided high-probability bounds on this spectral norm. These bounds have a 1/ √ τ dependence on τ , for large τ .…”
Section: Introductionmentioning
confidence: 99%
“…This is related to tight clustering condition [13] and less stringent than earlier results which assume that within-and-between-cluster similarities are constant and bounded in expectation [14]. The condition enforces that within-and-between-cluster similarities concentrate away from each other.…”
Section: Hierarchical Intuitionistic Fuzzy Possibilistic Cmeans Kernementioning
confidence: 92%
“…The condition enforces that within-and-between-cluster similarities concentrate away from each other. This condition is satisfied if similarities are constant in expectation, perturbed with any subgaussian noise [14], [15].…”
Section: Hierarchical Intuitionistic Fuzzy Possibilistic Cmeans Kernementioning
confidence: 99%