2018
DOI: 10.1007/s00180-018-0830-y
|View full text |Cite
|
Sign up to set email alerts
|

Bootstrapping estimates of stability for clusters, observations and model selection

Abstract: Clustering is a challenging problem in unsupervised learning. In lieu of a gold standard, stability has become a valuable surrogate to performance and robustness. In this work, we propose a non-parametric bootstrapping approach to estimating the stability of a clustering method, which also captures stability of the individual clusters and observations. This flexible framework enables different types of comparisons between clusterings and can be used in connection with two if possible bootstrap approaches for s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
63
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 32 publications
(64 citation statements)
references
References 15 publications
1
63
0
Order By: Relevance
“…Bootstrap procedure : Stability estimation utilizes bootstrapping to estimate the stability of clusterings [38]. However, in our setting, the nodes in a network are variables instead of subjects, therefore are typically fixed during the sampling process.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Bootstrap procedure : Stability estimation utilizes bootstrapping to estimate the stability of clusterings [38]. However, in our setting, the nodes in a network are variables instead of subjects, therefore are typically fixed during the sampling process.…”
Section: Methodsmentioning
confidence: 99%
“…In lieu of a gold standard, different forms of cluster stability have been used as a surrogate to assess performance. Stability estimates capture how stable the clusterings are over several different representations of the data, which are derived through subsetting, cross‐validation, data noising or re‐sampling, among others [4, 12, 14, 15, 20, 22, 26, 36–38].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…The work of Koepke and Clarke is an exception as the stability can be computed at the levels of individual points, clusters, or overall partitions. More recently, Yu et al also proposed to measure stability at the three levels using bootstrap resampling. Another exception is made by Hennig , who proposed a measure to evaluate the stability of individual clusters.…”
Section: Introductionmentioning
confidence: 99%