2006
DOI: 10.1007/11731139_22
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Density-Based Clustering of Complex Objects

Abstract: Abstract. In many scientific, engineering or multimedia applications, complex distance functions are used to measure similarity accurately. Furthermore, there often exist simpler lower-bounding distance functions, which can be computed much more efficiently. In this paper, we will show how these simple distance functions can be used to parallelize the density-based clustering algorithm DBSCAN. First, the data is partitioned based on an enumeration calculated by the hierarchical clustering algorithm OPTICS, so … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 46 publications
(22 citation statements)
references
References 8 publications
0
22
0
Order By: Relevance
“…The experiment shows the algorithm achieves near linear speedup. Brecheisen et al [10] present a parallel DBSCAN on a workstation, which is parallelized by a conservative approximation of complex distance functions, based on the concept of filter merge points. The final result is derived from a global cluster connectivity graph.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The experiment shows the algorithm achieves near linear speedup. Brecheisen et al [10] present a parallel DBSCAN on a workstation, which is parallelized by a conservative approximation of complex distance functions, based on the concept of filter merge points. The final result is derived from a global cluster connectivity graph.…”
Section: Related Workmentioning
confidence: 99%
“…= Unclassified then (7) if . = Unclassified then (8) Create c-cluster ← ( ); (9) if expandcluster ( , , , , , ) then (10) Create c-cluster ← ( ++); (11) end if (12) end if (13) the cells according to the received points and given in each of the mappers, so the cell with the same in different mappers stands for the same area; thus we can only use the cell to locate the assigned range in overall data space. (2) The points in an inclusive cell must belong to a c-cluster, so the cell and c-cluster are enough to stand for classification of all points in the cell.…”
Section: Cludoop Frameworkmentioning
confidence: 99%
“…The algorithm starts with an arbitrary point x ∈ X and retrieves its eps-neighborhood (Line 4). If the epsneighborhood contains at least minpts points, the procedure yields a new cluster, C. The algorithm then retrieves all points in X, which are density reachable from x and adds them to the cluster C (Line [8][9][10][11][12][13][14][15][16][17]. If the eps-neighborhood of x has less than minpts, then x is marked as noise (Line 6).…”
Section: Condition 23 (Noise)mentioning
confidence: 99%
“…The speedups are plotted in Figure 5 Finally, we have compared our parallel DBSCAN algorithm with the previous master-slave approaches [14], [15], [16], [17], [18], [19], [20]. As their source codes are not available, we have implemented their ideas, where the master process perform the cluster assignment while the slave processes answer the neighborhood queries [15], [17].…”
Section: B Parallel Dbscan On a Shared Memory Computermentioning
confidence: 99%
See 1 more Smart Citation