Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2016
DOI: 10.1145/2939672.2939800
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Frequent Directions Algorithm for Sparse Matrices

Abstract: This paper describes Sparse Frequent Directions, a variant of Frequent Directions for sketching sparse matrices. It resembles the original algorithm in many ways: both receive the rows of an input matrix A n×d one by one in the streaming setting and compute a small sketch B ∈ R ×d . Both share the same strong (provably optimal) asymptotic guarantees with respect to the spaceaccuracy tradeoff in the streaming setting. However, unlike Frequent Directions which runs in O(nd ) time regardless of the sparsity of th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
25
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(26 citation statements)
references
References 45 publications
(58 reference statements)
1
25
0
Order By: Relevance
“…Fiber subset selection, also called tensor cross approximation (TCA), finds a small subset of fibers which approximates the entire data tensor. For the matrix case, this problem is known as the Column/Row Subset Selection or CUR Problem which has been thoroughly investigated and for which there exist several algorithms with almost matching lower bounds [64,82,140].…”
Section: Tensor Sketching Using Tucker Modelmentioning
confidence: 99%
“…Fiber subset selection, also called tensor cross approximation (TCA), finds a small subset of fibers which approximates the entire data tensor. For the matrix case, this problem is known as the Column/Row Subset Selection or CUR Problem which has been thoroughly investigated and for which there exist several algorithms with almost matching lower bounds [64,82,140].…”
Section: Tensor Sketching Using Tucker Modelmentioning
confidence: 99%
“…Graph sketches [Ahn et al 2012;Liberty 2013;Ghashami et al 2016], or data synopses obtained by applying linear projections, are also relevant. Graph sketching can be viewed as linear dimensionality reduction, where the linearity of sketches makes them applicable to the analysis of streaming graphs with node and edge additions and deletions and distributed settings, such as MapReduce [Dean and Ghemawat 2004].…”
Section: Simplification-based Methodsmentioning
confidence: 99%
“…In summary, these techniques estimate the properties of the original graph, estimate relative frequencies of its substructures and then create a small sample subgraph that resembles the original graph. Also, there are techniques [10,22] that use linear dimensionality reduction on the complex graph to generate simplified graph sketches or data synopses. Grouping-based methods.…”
Section: Related Workmentioning
confidence: 99%