2006
DOI: 10.1109/tpami.2006.66
|View full text |Cite
|
Sign up to set email alerts
|

A genetic algorithm using hyper-quadtrees for low-dimensional k-means clustering

Abstract: The k-means algorithm is widely used for clustering because of its computational efficiency. Given n points in d-dimensional space and the number of desired clusters k, k-means seeks a set of k cluster centers so as to minimize the sum of the squared Euclidean distance between each point and its nearest cluster center. However, the algorithm is very sensitive to the initial selection of centers and is likely to converge to partitions that are significantly inferior to the global optimum. We present a genetic a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2006
2006
2017
2017

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 98 publications
(28 citation statements)
references
References 14 publications
0
28
0
Order By: Relevance
“…The algorithms appear to be scalable for larger values of k due to increasing sparsity of discs in the auxiliary problems. It is worthwhile to mention that some of the state-of-art heuristics proposed in [11,24,36,37,47,59] did not report the optimal solutions found here for the Reinelt's drilling data set with n = 1060 entities and k = 120, 150. To the best of our knowledge, this is the first time that such solutions are reported in the literature.…”
Section: Results In the Planementioning
confidence: 91%
“…The algorithms appear to be scalable for larger values of k due to increasing sparsity of discs in the auxiliary problems. It is worthwhile to mention that some of the state-of-art heuristics proposed in [11,24,36,37,47,59] did not report the optimal solutions found here for the Reinelt's drilling data set with n = 1060 entities and k = 120, 150. To the best of our knowledge, this is the first time that such solutions are reported in the literature.…”
Section: Results In the Planementioning
confidence: 91%
“…It analyses the relationship between the ependent or response variable and independent or predictor variables. The relationship is expressed in the form of an equation that predicts the response variable as a linear function of predictor variable [7][8][9][10]. Linear Regression: Y=a+bX+u.…”
Section: Regressionmentioning
confidence: 99%
“…This includes simulated annealing [21], evolutionary algorithms [22], [24], [18], tabu search [11], and ant colony optimization [6]. Also, hybrid approaches that combine multiple algorithms have been proposed in literature [24], [22].…”
Section: Related Workmentioning
confidence: 99%
“…The data are taken from German Town Data, which is a twodimensional data set with 59 observations, obtained from [35]. The SSE value for KMeans clustering for five clusters is the reported minimum value in literature [24].…”
Section: Definition Of Strategymentioning
confidence: 99%