Approximating K‐means‐type Clustering via Semidefinite Programming

Peng, Jiming; Yu, Wei

doi:10.1137/050641983

Cited by 140 publications

(215 citation statements)

References 23 publications

Supporting

Mentioning

209

Contrasting

Order By: Relevance

“…Modifications of k-means to account for must-link/cannot-link constraints are discussed in [15], distance-type constraints on the cluster centers are discussed in [16], and lower-bound constraints on the number of points per cluster are discussed in [17]. As an alternative to alternating optimization-based k-means, approximation algorithms based on convex (semidefinite) optimization [18] are also known; see, e.g., [19] and the references therein.…”

Section: Paper-to-session Assignmentmentioning

confidence: 99%

Signal Processing and Optimization Tools for Conference Review and Session Assignment

Sidiropoulos

Tsakonas

2015

IEEE Signal Process. Mag.

View full text Add to dashboard Cite

nyone who has served as a technical program committee (TPC) chair for a conference (or program manager for a funding agency) understands that paper (or proposal panel) review assignment is a demanding job that takes a lot of time, and reviewers are rarely satisfied with the end results. This article presents signal processing tools for two critical "mass assignment" tasks: assigning papers (or proposals) to reviewers in a way that matches reviewing expertise to scientific content while respecting the reviewers' capacity constraints and splitting accepted papers (or submitted proposals) to sessions (panels) while adhering to session (panel) capacity constraints. The basic idea is to use feature vectors to represent papers and reviewers. Features can be key words or phrases (e.g., optimization or sensor networks) or other types of attributes (e.g., timeliness). This viewpoint enables optimal assignment problem formulations that make sense from a scientific and practical point of view. While optimal solutions are hard to compute for a large number of papers and

show abstract

Section: Paper-to-session Assignmentmentioning

confidence: 99%

Signal Processing and Optimization Tools for Conference Review and Session Assignment

Sidiropoulos

Tsakonas

2015

IEEE Signal Process. Mag.

View full text Add to dashboard Cite

show abstract

“…Every column of h is on the simplex, h is positive semidefinite, its trace and rank are K. We can then reformulate the problem as minimizing h, w over the set of such matrices. Linear programming and semidefinite programming convex relaxations of this problem have been proposed [33][34][35]. They are less efficient in practice than the convex formulation (12) [35].…”

Section: Prior Work On the K-means Problemmentioning

confidence: 99%

A Convex Approach to K-Means Clustering and Image Segmentation

Condat

2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract.A new convex formulation of data clustering and image segmentation is proposed, with fixed number K of regions and possible penalization of the region perimeters. So, this problem is a spatially regularized version of the K-means problem, a.k.a. piecewise constant Mumford-Shah problem. The proposed approach relies on a discretization of the search space; that is, a finite number of candidates must be specified, from which the K centroids are determined. After reformulation as an assignment problem, a convex relaxation is proposed, which involves a kind of l1,∞ norm ball. A splitting of it is proposed, so as to avoid the costly projection onto this set. Some examples illustrate the efficiency of the approach.

show abstract

“…The first equality holds because both sets of constraints bound the eigenvalues of the matrices to be either 0 or 1, with exactly k being 1 [27]. Unfortunately, neither of the first two sets of constraints is convex on M .…”

Section: Lemmamentioning

confidence: 99%

Efficient global optimization for exponential family PCA and low-rank matrix factorization

Guo

Schuurmans

2008

2008 46th Annual Allerton Conference on Communication, Control, and Computing

View full text Add to dashboard Cite

Abstract-We present an efficient global optimization algorithm for exponential family principal component analysis (PCA) and associated low-rank matrix factorization problems. Exponential family PCA has been shown to improve the results of standard PCA on non-Gaussian data. Unfortunately, the widespread use of exponential family PCA has been hampered by the existence of only local optimization procedures. The prevailing assumption has been that the non-convexity of the problem prevents an efficient global optimization approach from being developed. Fortunately, this pessimism is unfounded. We present a reformulation of the underlying optimization problem that preserves the identity of the global solution while admitting an efficient optimization procedure. The algorithm we develop involves only a sub-gradient optimization of a convex objective plus associated eigenvector computations. (No general purpose semidefinite programming solver is required.) The lowrank constraint is exactly preserved, while the method can be kernelized through a consistent approximation to admit a fixed non-linearity. We demonstrate improved solution quality with the global solver, and also add to the evidence that exponential family PCA produces superior results to standard PCA on non-Gaussian data.

show abstract

Approximating K‐means‐type Clustering via Semidefinite Programming

Cited by 140 publications

References 23 publications

Signal Processing and Optimization Tools for Conference Review and Session Assignment

Signal Processing and Optimization Tools for Conference Review and Session Assignment

A Convex Approach to K-Means Clustering and Image Segmentation

Efficient global optimization for exponential family PCA and low-rank matrix factorization

Contact Info

Product

Resources

About