2004
DOI: 10.1007/978-3-540-30116-5_23
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Density-Based Distributed Clustering

Abstract: Abstract. Clustering has become an increasingly important task in analysing huge amounts of data. Traditional applications require that all data has to be located at the site where it is scrutinized. Nowadays, large amounts of heterogeneous, complex data reside on different, independently working computers which are connected to each other via local or wide area networks. In this paper, we propose a scalable density-based distributed clustering algorithm which allows a user-defined trade-off between clustering… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
55
0
4

Year Published

2005
2005
2016
2016

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 72 publications
(59 citation statements)
references
References 10 publications
0
55
0
4
Order By: Relevance
“…(1) mark all objects as unvisited; (2) do (3) randomly select an unvisited object p; (4) mark p as visited; (5 (6) create a new cluster C, and add p to C; (7) let N be the set of objects in the epsneighborhood of p; (8) for each point p' in N (9) if p' is unvisited (10) mark p' as visited; (11) if the eps-neighborhood of p' has at least MinPts points, add those points to N; (12) if p'…”
Section: Methodmentioning
confidence: 99%
See 1 more Smart Citation
“…(1) mark all objects as unvisited; (2) do (3) randomly select an unvisited object p; (4) mark p as visited; (5 (6) create a new cluster C, and add p to C; (7) let N be the set of objects in the epsneighborhood of p; (8) for each point p' in N (9) if p' is unvisited (10) mark p' as visited; (11) if the eps-neighborhood of p' has at least MinPts points, add those points to N; (12) if p'…”
Section: Methodmentioning
confidence: 99%
“…For instance in Fig.1,the points 1 , 2 , … … … … , 12 ,and q is not directly density-reachable by the point p. Centering at the point p, a circle defined by a given eps-neighborhood containing at least two other points The point 1 is directly densityreachable from p, and it is used as a center point to draw the next circle with the same eps-neighborhood. The same neighborhood-forming mechanism repeatedly applies to point p2 (which is directly density-reachable from p1) 3 , … … … … , 12 and q in sequence, as shown by the gray circles. However, in Web opinion clustering [30], it may create problems.…”
Section: Description Of the Problemmentioning
confidence: 99%
“…FoF is a special case of the DBSCAN algorithm [23] corresponding to a M inP ts parameter of zero; there exists a large body of work on distributed DBSCAN algorithms [11,13,14,24]. This prior work can be categorized into two groups.…”
Section: Background and Related Workmentioning
confidence: 99%
“…These algorithms build a distributed spatial index on a shared-nothing cluster and use this index when merging local clustering results [11,24]. The second category of approaches is to perform approximate clustering by using clustering on local models [13] or using samples to reduce the size of data or the number of spatial index lookups [14].…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation