2013
DOI: 10.1016/j.eswa.2013.07.002
|View full text |Cite
|
Sign up to set email alerts
|

Cluster center initialization algorithm for K-modes clustering

Abstract: a b s t r a c tPartitional clustering of categorical data is normally performed by using K-modes clustering algorithm, which works well for large datasets. Even though the design and implementation of K-modes algorithm is simple and efficient, it has the pitfall of randomly choosing the initial cluster centers for invoking every new execution that may lead to non-repeatable clustering results. This paper addresses the randomized center initialization problem of K-modes algorithm by proposing a cluster center i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
79
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 104 publications
(84 citation statements)
references
References 25 publications
1
79
0
Order By: Relevance
“…First, they start with an initial partition of the data, and second, the quality of this partition is improved by a local search algorithm during the search phase. The initial partition can be obtained based on many different principles [4,8], but a common strategy is to use distinct prototypes [9]. Most typically, the globalization of the whole algorithm is based on random initialization with several regenerations [10].…”
Section: General Prototype-based Clustering and Its Convergencementioning
confidence: 99%
See 1 more Smart Citation
“…First, they start with an initial partition of the data, and second, the quality of this partition is improved by a local search algorithm during the search phase. The initial partition can be obtained based on many different principles [4,8], but a common strategy is to use distinct prototypes [9]. Most typically, the globalization of the whole algorithm is based on random initialization with several regenerations [10].…”
Section: General Prototype-based Clustering and Its Convergencementioning
confidence: 99%
“…The initial prototypes should be separated from each other [4,8]. Lately, the K-means++ algorithm [9], where the random initialization is based on a density function favoring distinct prototypes, has become the most popular variant to initialize the K-means-type of an algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…Due to sensitivity of the algorithm to the initial parameters, it is important to ensure K-modes clustering with good initial cluster centers [33]. However, there are still no generally accepted initialization methods for kmeans clustering.…”
Section: Related Workmentioning
confidence: 99%
“…This section will detail describe the scaling mechanism, namely the theoretical foundation of knowledge scaling [16][17][18][19].…”
Section: Theoretical Foundationmentioning
confidence: 99%