2006
DOI: 10.1007/11871637_15
|View full text |Cite
|
Sign up to set email alerts
|

Measuring Constraint-Set Utility for Partitional Clustering Algorithms

Abstract: Abstract.Clustering with constraints is an active area of machine learning and data mining research. Previous empirical work has convincingly shown that adding constraints to clustering improves performance, with respect to the true data labels. However, in most of these experiments, results are averaged over different randomly chosen constraint sets, thereby masking interesting properties of individual sets. We demonstrate that constraint sets vary significantly in how useful they are for constrained clusteri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
118
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 134 publications
(124 citation statements)
references
References 4 publications
6
118
0
Order By: Relevance
“…Although previous work report on average sub-stantial improvement in the clustering purity, (Davidson et al, 2006) shows that even if the constraints are generated from gold-standard data, some constraint sets can decrease clustering purity. The results vary significantly depending on the specific set of constraints used.…”
Section: Constrained Clusteringmentioning
confidence: 75%
“…Although previous work report on average sub-stantial improvement in the clustering purity, (Davidson et al, 2006) shows that even if the constraints are generated from gold-standard data, some constraint sets can decrease clustering purity. The results vary significantly depending on the specific set of constraints used.…”
Section: Constrained Clusteringmentioning
confidence: 75%
“…Cannot-link constraints (like charges) increase the weights of affected edge. The situation is illustrated in Figure 1, where must-link constraints are {e (2,3), e (6,7), e (14,15)} and the cannot-link constraints are {e(9,10)}.…”
Section: Magnetically Affected Paths (Map)mentioning
confidence: 99%
“…In some experiments, we detect phenomena where the accuracy of the algorithm goes up and down slightly as we increase the number of constraints. As shown in [6], this is a general problem of randomly-chosen constraint sets, where some constraints reduce the clustering performance. Thus, a learning metric or an edge weight re-adjustment method, is not always reliable for a small number of constraints.…”
Section: Comparison With Other Techniquesmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, Wagstaff et al, and Davidson et al attempted to link the quality of constraint sets with clustering algorithm performance (Davidson, Wagstaff et al 2006;Wagsta, Basu et al 2006). Two properties of constraint setinconsistency and incoherence -were shown to be strongly negative correlated with clustering algorithm performance.…”
Section: Introductionmentioning
confidence: 99%