2017
DOI: 10.1073/pnas.1700770114
|View full text |Cite
|
Sign up to set email alerts
|

Robust continuous clustering

Abstract: Clustering is a fundamental procedure in the analysis of scientific data. It is used ubiquitously across the sciences. Despite decades of research, existing clustering algorithms have limited effectiveness in high dimensions and often require tuning parameters for different domains and datasets. We present a clustering algorithm that achieves high accuracy across multiple domains and scales efficiently to high dimensions and large datasets. The presented algorithm optimizes a smooth continuous objective, which… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
106
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 157 publications
(115 citation statements)
references
References 39 publications
(36 reference statements)
2
106
0
1
Order By: Relevance
“…The V-Nases A. chroococcum [190] and A. vinelandii [191] were studied by Xray absorption near edge structure (XANES) in the EXAFS region and the data considered consistent with V II or V IV in a distorted octahedral coordination environment. On the other hand, from the synthesis of clusters modeling nitrogenases it was concluded that the oxidation state of vanadium would be V III [192], although the assignment of oxidation state in these type of clusters is difficult because of electron delocalization.…”
Section: Nitrogenasesmentioning
confidence: 99%
“…The V-Nases A. chroococcum [190] and A. vinelandii [191] were studied by Xray absorption near edge structure (XANES) in the EXAFS region and the data considered consistent with V II or V IV in a distorted octahedral coordination environment. On the other hand, from the synthesis of clusters modeling nitrogenases it was concluded that the oxidation state of vanadium would be V III [192], although the assignment of oxidation state in these type of clusters is difficult because of electron delocalization.…”
Section: Nitrogenasesmentioning
confidence: 99%
“…Accordingly, scientists have been studying clustering algorithms for more than a half century, with a large number of excellent algorithms having been put forward and widely used. Wellknown algorithms can be divided into different categories, such as the partition methods (2,3), the densitybased algorithms (4,5), the affinity propagation algorithms (6,7), the feature transformation methods (8)(9)(10), and so on. The clustering results of these algorithms are often challenged by the poor readability, caused by the lack of observable representative data point for each cluster, the exogenously determined number of clusters together with the fixed granularity, and the unclear organization of resulted clusters.…”
Section: Introductionmentioning
confidence: 99%
“…The improvised variants of the kNN algorithm work in two stages, the first stage is used to learn and identify an optimal-k parameter value for each corresponding test data sample and subsequently the second stage uses this value on the traditional kNN classification method in order to predict the test tuple samples by using the optimal-k parameter value thus learned [15]. But this approach suffers from the disadvantage of being slow as the two stage process of first learning an optimal-k parameter value for each test sample and the subsequent process of scanning again all the training data samples for the purpose of finding the nearest data tuple neighbors of each given test tuple is time-consuming [16].…”
Section: A Knn Classifiermentioning
confidence: 99%