2007
DOI: 10.1137/050641983
|View full text |Cite
|
Sign up to set email alerts
|

Approximating K‐means‐type Clustering via Semidefinite Programming

Abstract: One of the fundamental clustering problems is to assign n points into k clusters based on the minimal sum-of-squares(MSSC), which is known to be NP-hard. In this paper, by using matrix arguments, we first model MSSC as a so-called 0-1 semidefinite programming (SDP). We show that our 0-1 SDP model provides an unified framework for several clustering approaches such as normalized k-cut and spectral clustering. Moreover, the 0-1 SDP model allows us to solve the underlying problem approximately via the relaxed lin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

6
209
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 140 publications
(215 citation statements)
references
References 23 publications
6
209
0
Order By: Relevance
“…Modifications of k-means to account for must-link/cannot-link constraints are discussed in [15], distance-type constraints on the cluster centers are discussed in [16], and lower-bound constraints on the number of points per cluster are discussed in [17]. As an alternative to alternating optimization-based k-means, approximation algorithms based on convex (semidefinite) optimization [18] are also known; see, e.g., [19] and the references therein.…”
Section: Paper-to-session Assignmentmentioning
confidence: 99%
“…Modifications of k-means to account for must-link/cannot-link constraints are discussed in [15], distance-type constraints on the cluster centers are discussed in [16], and lower-bound constraints on the number of points per cluster are discussed in [17]. As an alternative to alternating optimization-based k-means, approximation algorithms based on convex (semidefinite) optimization [18] are also known; see, e.g., [19] and the references therein.…”
Section: Paper-to-session Assignmentmentioning
confidence: 99%
“…Every column of h is on the simplex, h is positive semidefinite, its trace and rank are K. We can then reformulate the problem as minimizing h, w over the set of such matrices. Linear programming and semidefinite programming convex relaxations of this problem have been proposed [33][34][35]. They are less efficient in practice than the convex formulation (12) [35].…”
Section: Prior Work On the K-means Problemmentioning
confidence: 99%
“…The first equality holds because both sets of constraints bound the eigenvalues of the matrices to be either 0 or 1, with exactly k being 1 [27]. Unfortunately, neither of the first two sets of constraints is convex on M .…”
Section: Lemmamentioning
confidence: 99%