2022
DOI: 10.1007/s10618-022-00869-6
|View full text |Cite
|
Sign up to set email alerts
|

Large scale K-means clustering using GPUs

Abstract: The k-means algorithm is widely used for clustering, compressing, and summarizing vector data. We present a fast and memory-efficient GPU-based algorithm for exact k-means, Asynchronous Selective Batched K-means (ASB K-means). Unlike most GPU-based k-means algorithms that require loading the whole dataset onto the GPU for clustering, the amount of GPU memory required to run our algorithm can be chosen to be much smaller than the size of the whole dataset. Thus, our algorithm can cluster datasets whose size exc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 33 publications
(33 reference statements)
0
2
0
Order By: Relevance
“…In addition, K-means clustering method is widely used in large-scale data processing. Mi et al proposed a large-scale K-means clustering method based on GPUs in "Large scale K-means clustering using GPUs" [5], which provides an efficient computational means for the analysis of arrhythmia data. And M. AI et al in "K-means clustering algorithms: a comprehensive review, variants analysis, and advances in the era of big data" [6] provided a comprehensive evaluation and analysis of K-Means clustering algorithms that provides researchers with a more comprehensive selection of methods.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, K-means clustering method is widely used in large-scale data processing. Mi et al proposed a large-scale K-means clustering method based on GPUs in "Large scale K-means clustering using GPUs" [5], which provides an efficient computational means for the analysis of arrhythmia data. And M. AI et al in "K-means clustering algorithms: a comprehensive review, variants analysis, and advances in the era of big data" [6] provided a comprehensive evaluation and analysis of K-Means clustering algorithms that provides researchers with a more comprehensive selection of methods.…”
Section: Introductionmentioning
confidence: 99%
“…Cluster Analysis: K-means [12][13][14] was used, a well-known non-parametric technique for cluster analysis that has numerous fields of application such as healthcare, coronavirus, and urban hotspots, e.g., [15][16][17]. In k-means, each instance is assigned to its nearest centroid, which represents an average of the instances in the cluster.…”
mentioning
confidence: 99%