2008
DOI: 10.18637/jss.v025.i04
|View full text |Cite
|
Sign up to set email alerts
|

clValid: AnRPackage for Cluster Validation

Abstract: The R package clValid contains functions for validating the results of a clustering analysis. There are three main types of cluster validation measures available, "internal", "stability", and "biological". The user can choose from nine clustering algorithms in existing R packages, including hierarchical, K-means, self-organizing maps (SOM), and model-based clustering. In addition, we provide a function to perform the self-organizing tree algorithm (SOTA) method of clustering. Any combination of validation meas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
503
0
21

Year Published

2009
2009
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 653 publications
(570 citation statements)
references
References 31 publications
2
503
0
21
Order By: Relevance
“…This study uses four cluster validation measures to estimate the number of clusters in the data. These measures are implemented in the clValid and fpc packages in R software (Brock et al, 2008;Hennig, 2010;R Development Core Team, 2012).…”
Section: Resultsmentioning
confidence: 99%
“…This study uses four cluster validation measures to estimate the number of clusters in the data. These measures are implemented in the clValid and fpc packages in R software (Brock et al, 2008;Hennig, 2010;R Development Core Team, 2012).…”
Section: Resultsmentioning
confidence: 99%
“…The Dynamic Tree Cut (dynamicTreeCut R-package) (8) algorithm was used to define clusters. The clusters were validated using the internal and the stability measures designed in clValid R-package (9). The nonrandomness of patient distribution among clusters was tested by Fisher's Exact Test with simulated P-values (10 7 replicates).…”
Section: B-cell Profile Data Analysis and Hierarchical Clusteringmentioning
confidence: 99%
“…We allowed only groups of five or more patients to be considered as a cluster. The hierarchical clustering algorithm on this dataset was validated by clValid R-package (9). This approach uses all information contained in all six parameters from 1,024 bins to create a similarity matrix.…”
Section: Clusters Of Patientsmentioning
confidence: 99%
“…(Brock et al 2008) was used to select the optimal number of clusters. Almost 60% of variance among cadastres was explained with 5 clusters using PAM method.…”
Section: Methodsmentioning
confidence: 99%