Sketching for large-scale learning of mixture models

Keriven, Nicolas; Bourrier, Anthony; Gribonval, Rémi; Pérez, Patrick

doi:10.1093/imaiai/iax015

Cited by 52 publications

(151 citation statements)

References 80 publications

Supporting

Mentioning

151

Contrasting

Order By: Relevance

“…To recover the centroids C from y, the state-of-the-art algorithm is compressed learning via orthogonal matching pursuit with replacement (CL-OMPR) [5,6]. It aims to solve arg min…”

Section: A Sketched Clusteringmentioning

confidence: 99%

“…That is, {x t } T t=1 are assumed to be drawn i.i.d. from the GMM distribution (5). To recover the centroids C [c 1 , .…”

Section: A High-dimensional Inference Frameworkmentioning

confidence: 99%

“…As proposed in [5], the row-norms {g m } from (8) were drawn i.i.d. from the distribution (20) with shape parameter σ 2 .…”

Section: A High-dimensional Inference Frameworkmentioning

confidence: 99%

See 2 more Smart Citations

Sketched clustering via hybrid approximate message passing

Byrne

Gribonval

Schniter

2017

2017 51st Asilomar Conference on Signals, Systems, and Computers

View full text Add to dashboard Cite

In sketched clustering, a dataset of T samples is first sketched down to a vector of modest size, from which the centroids are subsequently extracted. Advantages include i) reduced storage complexity and ii) centroid extraction complexity independent of T . For the sketching methodology recently proposed by Keriven et al., which can be interpreted as a random sampling of the empirical characteristic function, we propose a sketched clustering algorithm based on approximate message passing. Numerical experiments suggest that our approach is more efficient than the state-of-the-art sketched clustering algorithm "CL-OMPR" (in both computational and sample complexity) and more efficient than k-means++ when T is large.Index Terms-clustering algorithms, data compression, compressed sensing, approximate message passing * E. Byrne (byrne.133@osu.edu) and P. Schniter (schniter.1@osu.edu) are with the

show abstract

“…To recover the centroids C from y, the state-of-the-art algorithm is compressed learning via orthogonal matching pursuit with replacement (CL-OMPR) [5,6]. It aims to solve arg min…”

Section: A Sketched Clusteringmentioning

confidence: 99%

“…That is, {x t } T t=1 are assumed to be drawn i.i.d. from the GMM distribution (5). To recover the centroids C [c 1 , .…”

Section: A High-dimensional Inference Frameworkmentioning

confidence: 99%

See 1 more Smart Citation

Sketched clustering via hybrid approximate message passing

Byrne

Gribonval

Schniter

2017

2017 51st Asilomar Conference on Signals, Systems, and Computers

View full text Add to dashboard Cite

show abstract

“…Note that this type of error is often considered when introducing quantization [7,19]. Ideally, one has η = 0, however in some cases it can be considerably simpler to prove that the LRIP holds with a non-zero η [20]. The reader would note that the classical RIP is often expressed with a constant α = (1 − t) −1 where t < 1 is a small as possible.…”

Section: Deterministic Operatormentioning

confidence: 99%

Instance Optimal Decoding and the Restricted Isometry Property

Keriven

Gribonval

2018

J. Phys.: Conf. Ser.

Self Cite

View full text Add to dashboard Cite

In this paper, we address the question of information preservation in ill-posed, non-linear inverse problems, assuming that the measured data is close to a low-dimensional model set. We provide necessary and sufficient conditions for the existence of a so-called instance optimal decoder, i.e., that is robust to noise and modelling error. Inspired by existing results in compressive sensing, our analysis is based on a (Lower) Restricted Isometry Property (LRIP), formulated in a non-linear fashion. We also provide sufficient conditions for non-uniform recovery with random measurement operators, with a new formulation of the LRIP. We finish by describing typical strategies to prove the LRIP in both linear and non-linear cases, and illustrate our results by studying the invertibility of a one-layer neural net with random weights.1 A pseudometric d satisfies all the requirements of a metric except d(x, y) = 0 ⇒ x = y. 2 Similarly, a seminorm satisfy the requirements of a norm except that x = 0 does not imply x = 0.

show abstract

“…Interestingly, κ also defines a Reproducing Kernel Hilbert Space in which two probability density functions (pdfs) can be compared with a Maximum Mean Discrepancy (MMD) metric [22][23][24][25][26]. Equipped with the MMD metric, the Generalized Method of Moments [27] in (3) is equivalent to an infinite-dimensional Compressed Sensing [11] problem, where the "sparse" pdf underlying the data (e.g., approximated by few Diracs) is reconstructed from a small number of compressive, random linear pdf measurements: the sketch [26]. The method to solve (3) is thus inspired by the OMP(R) CS recovery algorithm, i.e., Orthogonal Matching Pursuit (with Replacement) [28,29].…”

Section: Introductionmentioning

confidence: 99%

Quantized Compressive K-Means

Schellekens

Jacques

2018

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

The recent framework of compressive statistical learning proposes to design tractable learning algorithms that use only a heavily compressed representation-or sketch-of massive datasets. Compressive K-Means (CKM) is such a method: it aims at estimating the centroids of data clusters from pooled, non-linear, random signatures of the learning examples. While this approach significantly reduces computational time on very large datasets, its digital implementation wastes acquisition resources because the learning examples are compressed only after the sensing stage.The present work generalizes the CKM sketching procedure to a large class of periodic nonlinearities including hardware-friendly implementations that compressively acquire entire datasets. This idea is exemplified in a Quantized Compressive K-Means procedure, a variant of CKM that leverages 1-bit universal quantization (i.e., retaining the least significant bit of a standard uniform quantizer) as the periodic sketch nonlinearity. Trading for this resource-efficient signature (standard in most acquisition schemes) has almost no impact on the clustering performance, as illustrated by numerical experiments. *

show abstract

Sketching for large-scale learning of mixture models

Cited by 52 publications

References 80 publications

Sketched clustering via hybrid approximate message passing

Sketched clustering via hybrid approximate message passing

Instance Optimal Decoding and the Restricted Isometry Property

Quantized Compressive K-Means

Contact Info

Product

Resources

About