CUR decompositions are interpretable data analysis tools that express a data matrix in terms of a small number of actual columns and/or actual rows of the data matrix. One bottleneck of existing relative-error CUR algorithms lies on high computational complexity for computing important sampling probabilities. In this paper, we provide a simple yet effective framework that considers energy-based sampling algorithm. On one hand, we provide an intuitive and fast relative-error sampling algorithm for column selection problem. On the other hand, by combining the relative-error sampling algorithm with adaptive sampling algorithm we provide a novel CUR matrix approximation algorithms which is referred to as energy-based adaptive sampling algorithm. The sampling algorithm is the first adaptive relative-error CUR decomposition in the coherent sense. Specially, in each stage of our algorithm, we sample columns or rows from data matrix using sampling probabilities that are directly proportional to Euclidean norms of the columns or rows of the original data and residual matrix, respectively. Our empirical results exactly indicate that the new adaptive sampling algorithm typically achieves a good balance between computational complexity and approximate accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.