2019
DOI: 10.1007/s00357-019-09314-8
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Large Datasets by Merging K-Means Solutions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 26 publications
0
9
0
Order By: Relevance
“…The K-means algorithm is one of the most popular hierarchical algorithms and uses the minimum sum of squares to assign observations to groups. Such groups of data points are called clusters [46,47]. Observations allocated to the closest cluster, and the distance between an observation and a cluster is calculated from the Euclidean distance between the observation and the cluster center.…”
Section: Methodsmentioning
confidence: 99%
“…The K-means algorithm is one of the most popular hierarchical algorithms and uses the minimum sum of squares to assign observations to groups. Such groups of data points are called clusters [46,47]. Observations allocated to the closest cluster, and the distance between an observation and a cluster is calculated from the Euclidean distance between the observation and the cluster center.…”
Section: Methodsmentioning
confidence: 99%
“…Unlike existing robust methods, our proposal identifies general-shaped clusters and tolerates data contamination in a computationally efficient manner. It builds upon existing works based on two-step clustering, where a preliminary model-based algorithm is followed by a hierarchical agglomeration phase [31,34]. It thus inherits their properties but, unlike existing hybrid methods that lack robustness (i.e., only a pre-processing step is proposed in [34]), it can also detect and discard arbitrary forms of contamination.…”
Section: Motivationmentioning
confidence: 99%
“…The authors in [ 42 ] introduced DEMP-k (Directly Estimated Misclassification Probabilities), which is a combination of the HoSC-K-means (Homoscedastic Spherical Components) and hierarchical linkage functions, thereby increasing the speed and performance of the algorithm. Their work proposed a framework for hierarchical merging based on pairwise overlap between components, this was further applied to the K-means algorithm.…”
Section: Related Work On Computer Vision and Image Clusteringmentioning
confidence: 99%