2016
DOI: 10.1186/s13321-016-0114-x
|View full text |Cite
|
Sign up to set email alerts
|

How frequently do clusters occur in hierarchical clustering analysis? A graph theoretical approach to studying ties in proximity

Abstract: BackgroundHierarchical cluster analysis (HCA) is a widely used classificatory technique in many areas of scientific knowledge. Applications usually yield a dendrogram from an HCA run over a given data set, using a grouping algorithm and a similarity measure. However, even when such parameters are fixed, ties in proximity (i.e. two equidistant clusters from a third one) may produce several different dendrograms, having different possible clustering patterns (different classifications). This situation is usually… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
11
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 31 publications
2
11
0
Order By: Relevance
“…Additional ties may also appear at any step of the hierarchical clustering process, and it is even possible to have ties due to the limited resolution (number of decimal digits) used to store the proximities matrix. Therefore, we conclude that the clustering of microsatellite markers is prone to generate tied distances, similarly to what happens in other cases [ 16 ].…”
Section: Introductionsupporting
confidence: 62%
“…Additional ties may also appear at any step of the hierarchical clustering process, and it is even possible to have ties due to the limited resolution (number of decimal digits) used to store the proximities matrix. Therefore, we conclude that the clustering of microsatellite markers is prone to generate tied distances, similarly to what happens in other cases [ 16 ].…”
Section: Introductionsupporting
confidence: 62%
“…First, digital fingerprints of 3D ligand structures were generated using 3-point pharmacophore [ 16 ]. Next, hierarchical clustering with default parameters was performed to group similar compounds into different clusters using their fingerprints and a cluster ID was assigned to each compound [ 17 , 18 ].…”
Section: Methodsmentioning
confidence: 99%
“…To yield different clusters of the data set, dendrogram can be fragmented at different levels. Each fragmented level provides a visual summary of the cluster through a picture of the groups and their proximity with a dramatic reduction in dimensionality of the original data (Leal et al 2016). In this study, the Ward's method is used evaluate distance between clusters (Bhardwaj & Parmar 2020).…”
Section: Hierarchical Cluster Analysismentioning
confidence: 99%