2017 51st Asilomar Conference on Signals, Systems, and Computers 2017
DOI: 10.1109/acssc.2017.8335370
|View full text |Cite
|
Sign up to set email alerts
|

Sketched clustering via hybrid approximate message passing

Abstract: In sketched clustering, a dataset of T samples is first sketched down to a vector of modest size, from which the centroids are subsequently extracted. Advantages include i) reduced storage complexity and ii) centroid extraction complexity independent of T . For the sketching methodology recently proposed by Keriven et al., which can be interpreted as a random sampling of the empirical characteristic function, we propose a sketched clustering algorithm based on approximate message passing. Numerical experiments… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(14 citation statements)
references
References 26 publications
(69 reference statements)
1
13
0
Order By: Relevance
“…When n is very large, however, Lloyd's algorithm becomes computationally demanding. Instead, one could sketch the dataset using (1) and extract the centroids from the sketch [11,12]. For this purpose, one could use RF features (3) and (as explained in the sequel) solve for the centroids {c } k =1 and the (non-negative, sum-to-one) weights {α } k =1 that minimize z − k =1 α Φ(c ) .…”
Section: Illustration Using Four Worked Examplesmentioning
confidence: 99%
See 4 more Smart Citations
“…When n is very large, however, Lloyd's algorithm becomes computationally demanding. Instead, one could sketch the dataset using (1) and extract the centroids from the sketch [11,12]. For this purpose, one could use RF features (3) and (as explained in the sequel) solve for the centroids {c } k =1 and the (non-negative, sum-to-one) weights {α } k =1 that minimize z − k =1 α Φ(c ) .…”
Section: Illustration Using Four Worked Examplesmentioning
confidence: 99%
“…In this setting, the weights allow us to model unbalanced clusters, yet only the centroids need to be recovered. On large datasets, this approach can be orders-of-magnitude better than Lloyd's algorithm in memory and runtime, provided that the sketch dimension m is large enough, i.e., that m is on the order of kd, where kd is the number of free parameters in {c } k =1 [11,12]. For example, this method allows us to cluster the MNIST digit dataset, of dimension d = 784 and cardinality n = 70 000, using a complex-valued sketch of dimension m = 400 (see Box 2).…”
Section: Illustration Using Four Worked Examplesmentioning
confidence: 99%
See 3 more Smart Citations