Cluster center initialization algorithm for K-modes clustering

Khan, Shehroz S.; Ahmad, Amir

doi:10.1016/j.eswa.2013.07.002

Cited by 104 publications

(84 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…First, they start with an initial partition of the data, and second, the quality of this partition is improved by a local search algorithm during the search phase. The initial partition can be obtained based on many different principles [4,8], but a common strategy is to use distinct prototypes [9]. Most typically, the globalization of the whole algorithm is based on random initialization with several regenerations [10].…”

Section: General Prototype-based Clustering and Its Convergencementioning

confidence: 99%

See 1 more Smart Citation

Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering

2017

View full text Add to dashboard Cite

Abstract:Clustering is an unsupervised machine learning and pattern recognition method. In general, in addition to revealing hidden groups of similar observations and clusters, their number needs to be determined. Internal clustering validation indices estimate this number without any external information. The purpose of this article is to evaluate, empirically, characteristics of a representative set of internal clustering validation indices with many datasets. The prototype-based clustering framework includes multiple, classical and robust, statistical estimates of cluster location so that the overall setting of the paper is novel. General observations on the quality of validation indices and on the behavior of different variants of clustering algorithms will be given.

show abstract

Section: General Prototype-based Clustering and Its Convergencementioning

confidence: 99%

“…The initial prototypes should be separated from each other [4,8]. Lately, the K-means++ algorithm [9], where the random initialization is based on a density function favoring distinct prototypes, has become the most popular variant to initialize the K-means-type of an algorithm.…”

Section: Introductionmentioning

confidence: 99%

Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering

2017

View full text Add to dashboard Cite

show abstract

“…Due to sensitivity of the algorithm to the initial parameters, it is important to ensure K-modes clustering with good initial cluster centers [33]. However, there are still no generally accepted initialization methods for kmeans clustering.…”

Section: Related Workmentioning

confidence: 99%

An Anomaly Detection Based on Optimization

Alguliyev¹,

Alıguliyev²,

İmamverdiyev³

et al. 2017

IJISA

View full text Add to dashboard Cite

Abstract-At present, an anomaly detection is one of the important problems in many fields. The rapid growth of data volumes requires the availability of a tool for data processing and analysis of a wide variety of data types. The methods for anomaly detection are designed to detect object's deviations from normal behavior. However, it is difficult to select one tool for all types of anomalies due to the increasing computational complexity and the nature of the data. In this paper, an improved optimization approach for a previously known number of clusters, where a weight is assigned to each data point, is proposed. The aim of this article is to show that weighting of each data point improves the clustering solution. The experimental results on three datasets show that the proposed algorithm detects anomalies more accurately. It was compared to the k-means algorithm. The quality of the clustering result was estimated using clustering evaluation metrics. This research shows that the proposed method works better than k-means on the Australia (credit card applications) dataset according to the Purity, Mirkin and F-measure metrics, and on the heart diseases dataset according to F-measure and variation of information metric.

show abstract

“…This section will detail describe the scaling mechanism, namely the theoretical foundation of knowledge scaling [16][17][18][19].…”

Section: Theoretical Foundationmentioning

confidence: 99%

Deep Data Anaylizing Application Based on Scale Space Theory in Big Data Environment

Hu¹,

Gao²,

Xu³

2016

International Conferences on Software Engineering and Knowledge Engineering

View full text Add to dashboard Cite

Abstract-This paper introduces the basic scientific idea of the multi-scale to the field of big data analyzes, proposes a multi-scale framework of data analyzes in big data environment, present the multi-scale algorithm framework of knowledge conversion theory and apply the algorithm framework to the multi dimension association rules analysis. The proposed multi-scale association rule analysis algorithm uses the benchmark data set of analyzing results and the influence weight of benchmark data sets for target scale data sets to derived the association rules behind object scale data set, realize knowledge across scales derived and provide the possibility for multi-scale decision.

show abstract

Cluster center initialization algorithm for K-modes clustering

Cited by 104 publications

References 25 publications

Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering

Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering

An Anomaly Detection Based on Optimization

Deep Data Anaylizing Application Based on Scale Space Theory in Big Data Environment

Contact Info

Product

Resources

About