Partitional Clustering Algorithms 2014
DOI: 10.1007/978-3-319-09259-1_11
|View full text |Cite
|
Sign up to set email alerts
|

Hubness-Based Clustering of High-Dimensional Data

Abstract: Hubness has recently been established as a significant property of k-nearest neighbor (k-NN) graphs obtained from high-dimensional data using a distance measure, with traits and effects relevant to the cluster structure of data, as well as clustering algorithms. The hubness property is manifested with increasing (intrinsic) data dimensionality. The distribution of data point in-degrees, i.e. the number of times points appear among the k nearest neighbors of other points in the data, becomes highly skewed. This… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 69 publications
0
6
0
Order By: Relevance
“…Additionally, since it was demonstrated on several occasions that a better handling of hub points may result in better overall clustering quality in manydimensional problems [40,83,84], we intend to consider either extending the existing clustering quality indexes or proposing new ones that would incorporate this finding into account.…”
Section: Perspectives and Future Directionsmentioning
confidence: 97%
See 1 more Smart Citation
“…Additionally, since it was demonstrated on several occasions that a better handling of hub points may result in better overall clustering quality in manydimensional problems [40,83,84], we intend to consider either extending the existing clustering quality indexes or proposing new ones that would incorporate this finding into account.…”
Section: Perspectives and Future Directionsmentioning
confidence: 97%
“…Hubness-based clustering has recently been proposed for high-dimensional clustering problems [83,84] and has been successfully applied in some domains like document clustering [40].…”
Section: Clustering Techniques For High-dimensional Datamentioning
confidence: 99%
“…Unfortunately, some of the hubs are bad in the sort of sense that they may mislead machine learning algorithms. The presence of hubs have been studied primarily in context of classification, clustering and instance selection, see (Radovanović et al, 2010a), (Tomašev and Mladenić, 2013), (Radovanović et al, 2009), (Radovanović et al, 2010b), (Tomašev et al, 2011), (Tomašev et al, 2015b), , and (Tomašev et al, 2015a) for a survey.…”
Section: Related Workmentioning
confidence: 99%
“…A feature selection technique explores the possibility of dimensional subset for carrying out clustering by eliminating unnecessary and inappropriate dimensions. Subspace clustering is one of such technique that positions its search operation and generates information about the clusters present in multiple subspaces in overlapping conditions [4] [5].…”
Section: Introductionmentioning
confidence: 99%