2018
DOI: 10.3390/su10082614
|View full text |Cite|
|
Sign up to set email alerts
|

An Efficient Grid-Based K-Prototypes Algorithm for Sustainable Decision-Making on Spatial Objects

Abstract: Data mining plays a critical role in sustainable decision-making. Although the k-prototypes algorithm is one of the best-known algorithms for clustering both numeric and categorical data, clustering a large number of spatial objects with mixed numeric and categorical attributes is still inefficient due to complexity. In this paper, we propose an efficient grid-based k-prototypes algorithm, GK-prototypes, which achieves high performance for clustering spatial objects. The first proposed algorithm utilizes both … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 43 publications
0
3
0
Order By: Relevance
“…The weight age Hamming dissimilarity metric introduced is used, while this dissimilarity metric considers both the relative frequency and the distribution of each mode category. Jang et al [27] present a grid-based k-prototypes algorithm, namely GK-prototypes, that enhances the basic algorithm's performance. As far as categorical attributes are concerned, the algorithm takes into account the maximum distance between a cluster center and a cell, while as far as numeric attributes are concerned, the algorithm takes into account the maximum and minimum distances.…”
Section: Related Workmentioning
confidence: 99%
“…The weight age Hamming dissimilarity metric introduced is used, while this dissimilarity metric considers both the relative frequency and the distribution of each mode category. Jang et al [27] present a grid-based k-prototypes algorithm, namely GK-prototypes, that enhances the basic algorithm's performance. As far as categorical attributes are concerned, the algorithm takes into account the maximum distance between a cluster center and a cell, while as far as numeric attributes are concerned, the algorithm takes into account the maximum and minimum distances.…”
Section: Related Workmentioning
confidence: 99%
“…Kacem et al [26] propose parallelization of the K-prototypes clustering method [9] to handle large mixed datasets, this algorithm uses the MapReduce framework [108] for parallelization. Jang et al [27] use a gridbased indexing technique to develop grid-based K-prototypes algorithm that speeds up K-prototypes algorithm. The experiments carried out using a spatial dataset consisting of numeric and categorical features show that the proposed method takes less time than the original K-prototypes algorithm.…”
Section: A Partitional Clusteringmentioning
confidence: 99%
“…Kacem et al [26] propose parallelization of the K-prototypes clustering method [9] to handle large mixed datasets, this algorithm uses the MapReduce framework [108] for parallelization. Jang et al [27] use a grid-based indexing technique to develop grid-based Kprototypes algorithm that speeds up K-prototypes algorithm. The experiments carried out using a spatial dataset consisting of numeric and categorical features show that the proposed method takes less time than the original K-prototypes algorithm.…”
Section: A Partitional Clusteringmentioning
confidence: 99%