Partitional Clustering Algorithms 2014
DOI: 10.1007/978-3-319-09259-1_3
|View full text |Cite
|
Sign up to set email alerts
|

Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

Abstract: Over the past five decades, k-means has become the clustering algorithm of choice in many application domains primarily due to its simplicity, time/space efficiency, and invariance to the ordering of the data points. Unfortunately, the algorithm's sensitivity to the initial selection of the cluster centers remains to be its most serious drawback. Numerous initialization methods have been proposed to address this drawback. Many of these methods, however, have time complexity superlinear in the number of data po… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(48 citation statements)
references
References 96 publications
0
41
0
Order By: Relevance
“…Standard clustering evaluation approaches that were tailored for partitional clustering methods [18] are not universal and not well suited for other types of tasks like the evaluation of nested clustering structures, for which they need to be extended and adapted [24].…”
Section: Clustering Quality Indexes: Existing Surveysmentioning
confidence: 99%
“…Standard clustering evaluation approaches that were tailored for partitional clustering methods [18] are not universal and not well suited for other types of tasks like the evaluation of nested clustering structures, for which they need to be extended and adapted [24].…”
Section: Clustering Quality Indexes: Existing Surveysmentioning
confidence: 99%
“…Calculates probability value of the fitness value by dividing total fitness value by fitness value b) Calculate the cumulative c) Specify value randomly. Determine randomly a range [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20] and select the parent that will be the candidate toolbar and then select "1 Column" from the selection palette. d) Compare between random values and cumulative values The selection process is finished to get a new chromosome value and next for crossover process.…”
Section: A Proposed Methodsmentioning
confidence: 99%
“…It is also supported in the study of Celebi et. al [8] where a random initialization of the centroid process will cause K-means to be trapped in a minimum local point conditions. The local minimum is a situation where the right centroid is found only when the initial partition approaches the final solution.…”
Section: Introductionmentioning
confidence: 99%
“…The accuracy of the Davies-Bouldin Index is used to calculate the accuracy [17]. The Davis-Bouldin method is a function of the total ratio of intra cluster dispersion to the distance between clusters.…”
Section: -3 Calculation Competency Functionmentioning
confidence: 99%