2017
DOI: 10.6026/97320630013101
|View full text |Cite
|
Sign up to set email alerts
|

optCluster: An R Package for Determining the Optimal Clustering Algorithm

Abstract: There exist numerous programs and packages that perform validation for a given clustering solution; however, clustering algorithms fare differently as judged by different validation measures. If more than one performance measure is used to evaluate multiple clustering partitions, an optimal result is often difficult to determine by visual inspection alone. This paper introduces optCluster, an R package that uses a single function to simultaneously compare numerous clustering partitions (created by different al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(37 citation statements)
references
References 7 publications
0
32
0
1
Order By: Relevance
“…The R package optCluster (R ver. 3.4.3) was then used to determine the best clustering algorithm among agnes, clara, diana, hierarchical, kmeans, pam, and sota clustering methods using the validation methods connectivity, Dunn index and silhouette width, and the number of clusters set to ten 83 . Clusters that showed similar expression profiles were then merged.…”
Section: Methodsmentioning
confidence: 99%
“…The R package optCluster (R ver. 3.4.3) was then used to determine the best clustering algorithm among agnes, clara, diana, hierarchical, kmeans, pam, and sota clustering methods using the validation methods connectivity, Dunn index and silhouette width, and the number of clusters set to ten 83 . Clusters that showed similar expression profiles were then merged.…”
Section: Methodsmentioning
confidence: 99%
“…We employed the R package optCluster (R version 3.4.3) to determine the optimal clustering method and the optimal number of clusters [16]. We implemented the five clustering methods (agglomerative hierarchical, hierarchical divisive, K-means, K-medoids and model-based clustering) with the number of clusters ranging between 4-10 and evaluated the clustering results using the Dunn index, silhouette width and adjusted connectivity as validating metrics.…”
Section: Cluster Validationmentioning
confidence: 99%
“…This method has been proven as an efficient way to indirectly evaluate the phenotypes of TME (Wang et al, 2018). By using optCluster (Sekula et al, 2017) to evaluate the internal and stability indexes of the seven clustering algorithms (clara, diana, hierarchical, kmeans, model, pam, and sota), the optimal number and the algorithm of clustering were determined. Finally, the Clara algorithm and three groups were selected as the most robust clustering parameters.…”
Section: Unsupervised Clustering Algorithm To Determine Tme Subtypes mentioning
confidence: 99%