2013
DOI: 10.1016/j.jpdc.2012.09.009
|View full text |Cite
|
Sign up to set email alerts
|

Fault tolerant decentralised -Means clustering for asynchronous large-scale networks

Abstract: The K-Means algorithm for cluster analysis is one of the most influential and popular data mining methods. Its straightforward parallel formulation is well suited for distributed memory systems with reliable interconnection networks, such as massively parallel processors and clusters of workstations. However, in large-scale geographically distributed systems the straightforward parallel algorithm can be rendered useless by a single communication failure or high latency in communication paths. The lack of scala… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0
7

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
3
2

Relationship

4
6

Authors

Journals

citations
Cited by 59 publications
(32 citation statements)
references
References 33 publications
(76 reference statements)
0
25
0
7
Order By: Relevance
“…Fatta et al designed a fault tolerant epidemic clustering algorithm which does not require global communication [7]. However, their research is focused on handling network failures and does not provide the same SSE cost as the standard k-means clustering algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…Fatta et al designed a fault tolerant epidemic clustering algorithm which does not require global communication [7]. However, their research is focused on handling network failures and does not provide the same SSE cost as the standard k-means clustering algorithm.…”
Section: Related Workmentioning
confidence: 99%
“…Correctly operating processes reach consensus when all of them detect the failed ones. In [14], an Epidemicbased aggregation protocol is used to perform a global synchronisation and reduction operation for a fully decentralised K-Means clustering without global communication patterns under node churn and message loss. The work in [15] investigated heuristic methods to detect the convergence of Epidemic aggregation and those methods could be used to build a consensus protocol for data aggregation.…”
Section: Related Workmentioning
confidence: 99%
“…Eyal et al [31] provide a generic algorithm for clustering in a static network. Fatta et al [32] propose a gossip-based distributed k-means clustering, which is initiated with similar initial centroids, and proceeds towards centroid convergence with rounds of gossiping. Shen et al [33] propose a distributed clustering in a static network, incorporating information theory measures.…”
Section: Related Workmentioning
confidence: 99%