Compressive K-means

Keriven, Nicolas; Tremblay, Nicolas; Traonmilin, Yann; Gribonval, Rémi

doi:10.1109/icassp.2017.7953382

Cited by 37 publications

(79 citation statements)

References 24 publications

(46 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To recover the centroids C from y, the state-of-the-art algorithm is compressed learning via orthogonal matching pursuit with replacement (CL-OMPR) [5,6]. It aims to solve arg min…”

Section: A Sketched Clusteringmentioning

confidence: 99%

Sketched clustering via hybrid approximate message passing

Byrne

Gribonval

Schniter

2017

2017 51st Asilomar Conference on Signals, Systems, and Computers

Self Cite

View full text Add to dashboard Cite

In sketched clustering, a dataset of T samples is first sketched down to a vector of modest size, from which the centroids are subsequently extracted. Advantages include i) reduced storage complexity and ii) centroid extraction complexity independent of T . For the sketching methodology recently proposed by Keriven et al., which can be interpreted as a random sampling of the empirical characteristic function, we propose a sketched clustering algorithm based on approximate message passing. Numerical experiments suggest that our approach is more efficient than the state-of-the-art sketched clustering algorithm "CL-OMPR" (in both computational and sample complexity) and more efficient than k-means++ when T is large.Index Terms-clustering algorithms, data compression, compressed sensing, approximate message passing * E. Byrne (byrne.133@osu.edu) and P. Schniter (schniter.1@osu.edu) are with the

show abstract

“…To recover the centroids C from y, the state-of-the-art algorithm is compressed learning via orthogonal matching pursuit with replacement (CL-OMPR) [5,6]. It aims to solve arg min…”

Section: A Sketched Clusteringmentioning

confidence: 99%

Sketched clustering via hybrid approximate message passing

Byrne

Gribonval

Schniter

2017

2017 51st Asilomar Conference on Signals, Systems, and Computers

Self Cite

View full text Add to dashboard Cite

show abstract

“…Estimating such signals from a finite number of Fourier measurements is known as the super-resolution problem [9]. Also, the estimation of spikes from random Fourier measurements is at the core of the compressive K-means algorithm were k-means cluster centers are estimated from a compressed database [21]. In the space M of finite signed measure over R d , we aim at recovering x 0 = i=1,k a i δ t i from the measurements…”

Section: Introduction 1contextmentioning

confidence: 99%

The basins of attraction of the global minimizers of the non-convex sparse spike estimation problem

Traonmilin

Aujol

2020

Inverse Problems

Self Cite

View full text Add to dashboard Cite

The sparse spike estimation problem consists in estimating a number of off-thegrid impulsive sources from under-determined linear measurements. Information theoretic results ensure that the minimization of a non-convex functional is able to recover the spikes for adequately chosen measurements (deterministic or random). To solve this problem, methods inspired from the case of finite dimensional sparse estimation where a convex program is used have been proposed. Also greedy heuristics have shown nice practical results. However, little is known on the ideal non-convex minimization method. In this article, we study the shape of the global minimum of this non-convex functional: we give an explicit basin of attraction of the global minimum that shows that the non-convex problem becomes easier as the number of measurements grows. This has important consequences for methods involving descent algorithms (such as the greedy heuristic) and it gives insights for potential improvements of such descent methods.

show abstract

“…The sketch is thus the pooling (average) of random projections of the data samples after passing through a nonlinear, periodic signature-the complex exponential. The Compressive K-Means (CKM) method [8] clusters X from z X by replacing (1) with a sketch matching optimization problem:…”

Section: Introductionmentioning

confidence: 99%

“…For the related kernel K-means problem, [20] uses Random Fourier Features [21], i.e., the low-dimensional mapping z (·) defined in (2). For u, v ∈ R n , the inner product z u , z v approximates a shift-invariant kernel κ(u, v) associated with the frequency distribution Λ. CKM [8] actually averages individual RFF of data points. Interestingly, κ also defines a Reproducing Kernel Hilbert Space in which two probability density functions (pdfs) can be compared with a Maximum Mean Discrepancy (MMD) metric [22][23][24][25][26].…”

Section: Introductionmentioning

confidence: 99%

“…as announced in (3). This non-convex problem is hard to solve exactly, but the CKM algorithm (based on OMPR) [8], detailed in pseudocode below, seeks an approximate solution. More precisely, CKM greedily selects new centroids minimizing a residual r ∈ C m (Steps 1 and 2) inside a box with lower and upper bounds l, u ∈ R n , respectively, enclosing the data X, and eventually replacing bad centroids in Step 3.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Quantized Compressive K-Means

Schellekens

Jacques

2018

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

The recent framework of compressive statistical learning proposes to design tractable learning algorithms that use only a heavily compressed representation-or sketch-of massive datasets. Compressive K-Means (CKM) is such a method: it aims at estimating the centroids of data clusters from pooled, non-linear, random signatures of the learning examples. While this approach significantly reduces computational time on very large datasets, its digital implementation wastes acquisition resources because the learning examples are compressed only after the sensing stage.The present work generalizes the CKM sketching procedure to a large class of periodic nonlinearities including hardware-friendly implementations that compressively acquire entire datasets. This idea is exemplified in a Quantized Compressive K-Means procedure, a variant of CKM that leverages 1-bit universal quantization (i.e., retaining the least significant bit of a standard uniform quantizer) as the periodic sketch nonlinearity. Trading for this resource-efficient signature (standard in most acquisition schemes) has almost no impact on the clustering performance, as illustrated by numerical experiments. *

show abstract

Compressive K-means

Cited by 37 publications

References 24 publications

Sketched clustering via hybrid approximate message passing

Sketched clustering via hybrid approximate message passing

The basins of attraction of the global minimizers of the non-convex sparse spike estimation problem

Quantized Compressive K-Means

Contact Info

Product

Resources

About