2022
DOI: 10.1002/sam.11573
|View full text |Cite
|
Sign up to set email alerts
|

A general iterative clustering algorithm

Abstract: The quality of a cluster analysis of unlabeled units depends on the quality of the between units dissimilarity measures. Data-dependent dissimilarity is more objective than data independent geometric measures such as Euclidean distance. As suggested by Breiman, many data driven approaches are based on decision tree ensembles, such as a random forest (RF), that produce a proximity matrix that can easily be transformed into a dissimilarity matrix. An RF can be obtained using labels that distinguish units with re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 32 publications
0
1
0
Order By: Relevance
“…In this section, the properties of the proximity matrix produced by TARF for use in a cluster analysis is appraised in eight real‐world studies. Dalleau et al [10] conducted empirical evaluations of ET in these datasets as did Lin et al [21] in their evaluation of the performance of a newgeneralized iterative clustering algorithm (GICA) for obtaining a proximity matrix. We use all eight of the studies for appraising TARF for clustering and four of them for classification.…”
Section: Evaluation Of Tarf In Real‐world Datasetsmentioning
confidence: 99%
“…In this section, the properties of the proximity matrix produced by TARF for use in a cluster analysis is appraised in eight real‐world studies. Dalleau et al [10] conducted empirical evaluations of ET in these datasets as did Lin et al [21] in their evaluation of the performance of a newgeneralized iterative clustering algorithm (GICA) for obtaining a proximity matrix. We use all eight of the studies for appraising TARF for clustering and four of them for classification.…”
Section: Evaluation Of Tarf In Real‐world Datasetsmentioning
confidence: 99%