2017
DOI: 10.1007/s11227-017-2182-8
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid MapReduce-based k-means clustering using genetic algorithm for distributed datasets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(25 citation statements)
references
References 23 publications
0
25
0
Order By: Relevance
“…On the other hand, [11] used a Genetic Algorithm (GA) with Mahalanobis distance along with the K-means clustering algorithm as an influential combination to propose a two-phase clustering algorithm for distributed datasets. In first phase, GA is utilized in parallel on fragments, which were assigned to different sites.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…On the other hand, [11] used a Genetic Algorithm (GA) with Mahalanobis distance along with the K-means clustering algorithm as an influential combination to propose a two-phase clustering algorithm for distributed datasets. In first phase, GA is utilized in parallel on fragments, which were assigned to different sites.…”
Section: Related Workmentioning
confidence: 99%
“…While Equation (11) computed the attribute access matrix of sites (AAMS), AAMS was used to yield the total access cost matrix for all sites (TACS) with the help of Equation (11). In Equations (12) and (13), the final allocation of fragments over the cluster of sites was decided when second and third scenarios of allocation were being addressed.…”
Section: Cost Functionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Yuan [23] proposed an improved K-means parallel algorithm has also achieved good results. Ankita [24] combines genetic algorithm with k-means algorithm and proposed a novel clustering algorithm for distributed datasets. The above work proves that the algorithm based on MapReduce can well avoid the limitation of data size, and makes the mining of hyper-scale product review data possible.…”
Section: Related Workmentioning
confidence: 99%
“…In order to further verify the feasibility of the PR-HD algorithm, we choose a total of three MapReduce-based algorithms from reference [22], [23] and [24] as the comparison algorithm. Reference [22] improved VSM model, and designed a parallel fuzzy c-means algorithm for hot microblogging topics discovery(HTD-PFCM).…”
Section: ) Accuracy Analysismentioning
confidence: 99%