2018
DOI: 10.1016/j.patcog.2018.05.030
|View full text |Cite
|
Sign up to set email alerts
|

A fast clustering algorithm based on pruning unnecessary distance computations in DBSCAN for high-dimensional data

Abstract: Clustering is an important technique to deal with large scale data which are explosively created in internet. Most data are high-dimensional with a lot of noise, which brings great challenges to retrieval, classification and understanding. No current existing approach is "optimal" for large scale data. For example, DB-SCAN requires O(n 2 ) time, Fast-DBSCAN only works well in 2 dimensions, and ρ-Approximate DBSCAN runs in O(n) expected time which needs dimension D to be a relative small constant for the linear… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
52
0
1

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 135 publications
(63 citation statements)
references
References 54 publications
0
52
0
1
Order By: Relevance
“…where w k,j , a uniformly distributed random data in [0, 1], is the weight coefficient of v f k,j , w k,j ≥ 0, and ∑ 4 k=1 w k,j = 1 . To satisfy this constraint, the conversion operations of w k,j in Equation 24are expressed as Equation 23, thus the candidates in Equation 22are rewritten as Equation (24).…”
Section: The Combined Forecasting Results By Optimized Weightsmentioning
confidence: 99%
See 1 more Smart Citation
“…where w k,j , a uniformly distributed random data in [0, 1], is the weight coefficient of v f k,j , w k,j ≥ 0, and ∑ 4 k=1 w k,j = 1 . To satisfy this constraint, the conversion operations of w k,j in Equation 24are expressed as Equation 23, thus the candidates in Equation 22are rewritten as Equation (24).…”
Section: The Combined Forecasting Results By Optimized Weightsmentioning
confidence: 99%
“…DBSCAN, a density-based spatial data clustering method, is widely used in image processing [22] and high-dimensional data procession [23,24]. The aims of the DBSCAN method are to group the candidate points into core points, border points, or outliers.…”
Section: Density Based Spatial Clustering Of Applications With Noise mentioning
confidence: 99%
“…In the future, we will focus on how to combine the different features more effectively and select a better classification model for the combined feature. Moreover, we will improve the proposed method by data preparation techniques, such as clustering [51][52][53], normalization, and data cleaning.…”
Section: Discussionmentioning
confidence: 99%
“…The photons are checked until the signal photon clusters are completely expanded or there are no photons left to check. A photon is defined as a "noise photon" if it does not belong to any classified signal cluster [21,39,40]. The clustering result and fitness value are calculated using the values of eps and MinPts, and then optimized by an iterative process.…”
Section: Methodsmentioning
confidence: 99%