Proceedings of the 22nd International Conference on Machine Learning - ICML '05 2005
DOI: 10.1145/1102351.1102481
|View full text |Cite
|
Sign up to set email alerts
|

A new Mallows distance based metric for comparing clusterings

Abstract: Despite of the large number of algorithms developed for clustering, the study on comparing clustering results is limited. In this paper, we propose a measure for comparing clustering results to tackle two issues insufficiently addressed or even overlooked by existing methods: (a) taking into account the distance between cluster representatives when assessing the similarity of clustering results; (b) constructing a unified framework for defining a distance based on either hard or soft clustering and ensuring th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
49
0
1

Year Published

2006
2006
2022
2022

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 50 publications
(50 citation statements)
references
References 13 publications
(17 reference statements)
0
49
0
1
Order By: Relevance
“…Given two users' community membership distributions, we use the Categorical Clustering Distance(CCD) [25] to compare the quality of these two distributions. This method relates only to users' community membership distributions.…”
Section: Evaluation Metricmentioning
confidence: 99%
“…Given two users' community membership distributions, we use the Categorical Clustering Distance(CCD) [25] to compare the quality of these two distributions. This method relates only to users' community membership distributions.…”
Section: Evaluation Metricmentioning
confidence: 99%
“…For partitionings pðS 1 Þ; pðS 2 Þ find the optimal clusters correspondence, such that the sum of distances between matched clusters is minimal. Then the contribution of a single data vector x i 2 S c to the overall difference between cluster structures is given by [26] …”
Section: Clustering Stabilitymentioning
confidence: 99%
“…We use the Categorical Clustering Distance(CCD) [7] to compare the similarity between the computed community distribution and the ideal community distribution. For PA-PER, we treat each conference as a community and the proportion of the number of papers one author published in each conference as the ideal probability the author belongs to that community.…”
Section: Community Membership Evaluationmentioning
confidence: 99%