2010
DOI: 10.5121/ijaia.2010.1404
|View full text |Cite
|
Sign up to set email alerts
|

Performance analysis of k-means with different initialization methods for high dimensional data

Abstract: Developing effective clustering method for high dimensional dataset is a challenging problem due to the curse of dimensionality. Among all the partition based clustering algorithms, k-means is one of the most well known methods to partition a dataset into groups of patterns. However, the k-means method converges to one of many local minima. And it is known that, the final result depends on the initial starting points (means). Many methods have been proposed to improve the performance of k-means algorithm. In t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0
2

Year Published

2013
2013
2023
2023

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(12 citation statements)
references
References 6 publications
0
10
0
2
Order By: Relevance
“…The first principal component represents maximum variability in the data and succeeding components describe the remaining variability [39]. PCA and the linear transformation are used for dimensionality and noise reduction and initial centeroid computation for k-means clustering algorithm [36] and [40]. The heuristics approach is used to reduce the number of distance calculation to assign the data point to the cluster.…”
Section: Principal Component Analysis (Pca)mentioning
confidence: 99%
“…The first principal component represents maximum variability in the data and succeeding components describe the remaining variability [39]. PCA and the linear transformation are used for dimensionality and noise reduction and initial centeroid computation for k-means clustering algorithm [36] and [40]. The heuristics approach is used to reduce the number of distance calculation to assign the data point to the cluster.…”
Section: Principal Component Analysis (Pca)mentioning
confidence: 99%
“…The author in literature [3] uses Principal Component Analysis for dimension reduction and to find initial cluster centers.…”
Section: Related Workmentioning
confidence: 99%
“…Data is normally done through a pre-process data cleansing, data integration, selection and transformation of data and prepared for mining. Data mining can also be done on different types of databases and data storage, but the type of pattern is found determined by different types of functionality mining data such as descriptions, association, correlation analysis, classification, prediction, analysis of clusters, and so on (Tajunisha, 2010).…”
Section: Introductionmentioning
confidence: 99%