2014
DOI: 10.1007/s11227-014-1225-7
|View full text |Cite
|
Sign up to set email alerts
|

Optimized big data K-means clustering using MapReduce

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
52
0
1

Year Published

2015
2015
2021
2021

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 128 publications
(53 citation statements)
references
References 14 publications
0
52
0
1
Order By: Relevance
“…The iteration dependence was eliminated, and high performance was obtained by using the processing model. Extensive experiments demonstrate that the proposed methods were efficient, robust, and scalable .…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The iteration dependence was eliminated, and high performance was obtained by using the processing model. Extensive experiments demonstrate that the proposed methods were efficient, robust, and scalable .…”
Section: Related Workmentioning
confidence: 99%
“…MapReduce [1][2][3][4][5][6] is a programming model aimed for parallel processing of large volumes of data by dividing the work into a set of independent tasks. In the MapReduce framework [7][8][9], a distributed file system (DFS) performs initial partitioning of data in multiple machines and represents data as pairs.…”
Section: Introductionmentioning
confidence: 99%
“…At present, there are a lot of improvements in the selection of initial clustering centers for K-means algorithm [5][6][7]. (1) The method of maximum and minimum distance is used to select the clustering center point then calculate the K value and find a reasonable center point.…”
Section: Related Workmentioning
confidence: 99%
“…Third, from the contents of documents, arbitrary semantic structures can be extracted. As the number of documents without annotations (i.e., unstructured texts) is growing exponentially, it is preferred to take unsupervised methods [5,23,31]. Topic modeling is one of such methods, and it captures the latent semantic structures across the documents.…”
Section: Introductionmentioning
confidence: 99%