Efficient Volume Sampling for Row/Column Subset Selection

Deshpande, Amit; Rademacher, Luis

doi:10.1109/focs.2010.38

Cited by 158 publications

(224 citation statements)

References 19 publications

Supporting

Mentioning

217

Contrasting

Unclassified

Order By: Relevance

“…We show that O(k/ǫ) columns contain a rank-k subspace which reconstructs A to relative error, and we present the first sub-SVD (in terms of running time) randomized algorithm to identify these columns. This matches the Ω(k/ǫ) lower bound in [8] and improves the best known upper bound of O(k log k + k/ǫ) [6,8,12,23]. …”

supporting

confidence: 63%

“…There is considerable interest (e.g. [4,6,8,9,11,15,19,20,21]) in determining a minimum set of r ≪ n columns of A which is approximately as good as A k at reconstructing A. Such columns are important for interpreting data [21], building robust machine learning algorithms [4], feature selection, etc.…”

mentioning

confidence: 99%

“…We present polynomial-time (deterministic and randomized) algorithms with approximation error asymptotically matching a lower bound proven in this work. Prior work had focused on the r = k case and presented near-optimal polynomial-time algorithms [6,17].…”

mentioning

confidence: 99%

See 2 more Smart Citations

Near-Optimal Column-Based Matrix Reconstruction

Boutsidis¹,

Drineas²,

2014

View full text Add to dashboard Cite

Abstract. We consider low-rank reconstruction of a matrix using a subset of its columns and we present asymptotically optimal algorithms for both spectral norm and Frobenius norm reconstruction. The main tools we introduce to obtain our results are: (i) the use of fast approximate SVD-like decompositions for column-based matrix reconstruction, and (ii) two deterministic algorithms for selecting rows from matrices with orthonormal columns, building upon the sparse representation theorem for decompositions of the identity that appeared in [1].

show abstract

supporting

confidence: 63%

mentioning

confidence: 99%

See 1 more Smart Citation

Near-Optimal Column-Based Matrix Reconstruction

Boutsidis¹,

Drineas²,

2014

View full text Add to dashboard Cite

show abstract

“…Thus, approximate algorithms with lower computational complexity have been presented in the relevant literature, with the goal of finding a suboptimal but acceptable solution. The proposed approaches include randomized, deterministic or hybrid methods, using SVD sparse approximation [16], random selection of matrix columns, based on a probability distribution, which is later refined deterministically [10], greedy recursive computation of the reconstruction error, initialized with random projections of the matrix columns [17], column subset selection with probabilities proportional to the squared volumes of the parallelepipeds defined by these subsets [18], etc.…”

Section: Multimodal Shot Pruning (Msp)mentioning

confidence: 99%

Movie shot selection preserving narrative properties

Mademlis

Tefas

Nikolaidis

et al. 2016

2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP)

View full text Add to dashboard Cite

“…These include the problem of sampling from a DPP, computing its partition function and the MAP-inference problem which asks to find the set of highest probability (or equivalently to find the largest coefficient of q(x)). For the case of unconstrained DPPs problems (1) and (2) are quite well understood, and various solutions have been proposed [Kha95,DR10,AGR16,Nik15,SEFM15]. Recently, the case of constrained DPPs -when the support is restricted to a combinatorial family B ⊆ 2 [m] -has been studied [NS16,SV16,CDKV16] with machine learning applications in mind, however, very little is known computationally.…”

Section: Introductionmentioning

confidence: 99%

Real stable polynomials and matroids: optimization and counting

Straszak

Vishnoi

2017

Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing

View full text Add to dashboard Cite

A great variety of fundamental optimization and counting problems arising in computer science, mathematics and physics can be reduced to one of the following computational tasks involving polynomials and set systems: given an oracle access to an m-variate real polynomial g and to a family of (multi-)subsets B of [m], (1) find S ∈ B such that the monomial in g corresponding to S has the largest coefficient in g, or (2) compute the sum of coefficients of monomials in g corresponding to all the sets that appear in B. Special cases of these problems, such as computing permanents and mixed discriminants, sampling from determinantal point processes, and maximizing subdeterminants with combinatorial constraints have been topics of much recent interest in theoretical computer science.In this paper we present a very general convex programming framework geared to solve both of these problems. Subsequently, we show that roughly, when g is a real stable polynomial with non-negative coefficients and B is a matroid, the integrality gap of our convex relaxation is finite and depends only on m (and not on the coefficients of g) -in fact, in most interesting cases it is never worse than e m . Prior to our work, such results were known only in important but sporadic cases that relied heavily on the structure of either g or B; it was not even a priori clear if one could formulate a convex relaxation that has a finite integrality gap beyond these special cases. Two notable examples are a result by Gurvits [Gur06] on the van der Waerden conjecture for all real stable g when B contains one element, and a result by Nikolov and Singh [NS16] for a family of multilinear real stable polynomials when B is the partition matroid. Our work, which encapsulates almost all interesting cases of g and B, benefits from both -we were inspired by the latter in coming up with the right convex programming relaxation and the former in deriving the integrality gap. However, proving our results requires significant extensions of both; in that process we come up with new notions and connections between real stable polynomials and matroids which should be of independent and wide interest.

show abstract

Efficient Volume Sampling for Row/Column Subset Selection

Cited by 158 publications

References 19 publications

Near-Optimal Column-Based Matrix Reconstruction

Near-Optimal Column-Based Matrix Reconstruction

Movie shot selection preserving narrative properties

Real stable polynomials and matroids: optimization and counting

Contact Info

Product

Resources

About