2016
DOI: 10.1016/j.is.2016.02.007
|View full text |Cite
|
Sign up to set email alerts
|

Single-pass and linear-time k-means clustering based on MapReduce

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 58 publications
(25 citation statements)
references
References 25 publications
0
24
0
1
Order By: Relevance
“…To deal with large scale data, several clustering methods which are based on parallel frameworks have been designed in the literature (Bahmani et al 2012;Hadian and Shahrivari 2014;Kim et al 2014;Ludwig 2015;Shahrivari and Jalili 2016;Zhao et al 2009). Most of these methods use the MapReduce framework.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To deal with large scale data, several clustering methods which are based on parallel frameworks have been designed in the literature (Bahmani et al 2012;Hadian and Shahrivari 2014;Kim et al 2014;Ludwig 2015;Shahrivari and Jalili 2016;Zhao et al 2009). Most of these methods use the MapReduce framework.…”
Section: Related Workmentioning
confidence: 99%
“…Bahmani et al have proposed a scalable k-means (Bahmani et al 2012) that extends k-means++ technique for initial seeding. Shahrivari and Jalili (2016) have proposed a single-pass and linear time MapReduce-based k-means method. Kim et al (2014) have proposed parallelizing densitybased clustering with MapReduce.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…K-Means algorithm similar with K-modes algorithm first select k samples as the centroid, use Europe distance as a similarity measure, for each sample, the remaining calculation to each centroid distance, and put it into the nearest centroid, finally re calculate the centroid [6,7]. Iteration until the centroid is no longer changed.…”
Section: Clustering Based Recommendation Algorithmmentioning
confidence: 99%
“…Therefore, it is important to study the graph mining algorithm based on disk, or a graph mining algorithm based on some parallel processing model, such as DNA model [19], MapReduce [20], etc..…”
Section: The Challenge Of Graph Data Miningmentioning
confidence: 99%