Learning Adaptive Random Features

Li, Yanjun; Zhang, Kai; Wang, Jun; Kumar, Sanjiv

doi:10.1609/aaai.v33i01.33014229

Cited by 8 publications

(6 citation statements)

References 16 publications

(51 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To overcome the computational and memory bottleneck of KRLS, practical algorithms are developed, including Nyström approach (Rudi, Carratino, and Rosasco 2017;Camoriano et al 2016) and divide-and-conquer (Zhang, Duchi, and Wainwright 2013;Li, Liu, and Wang 2019) of which statistical properties are well studied. Nyström (Rudi, Camoriano, and Rosasco 2015;Camoriano et al 2016) tactfully constructs some small-scale matrices, by sampling the dataset, to approximate the raw kernel matrix so that the time and space complexity can make a sudden drop.…”

Section: Related Workmentioning

confidence: 99%

“…In this section, for exploring the generalization ability, we firstly introduce four standard assumptions, which are widely used in statistical learning of squared loss (Smale and Zhou 2007;Caponnetto and Vito 2007;Rudi, Carratino, and Rosasco 2017;Li, Liu, and Wang 2019). Under the basic assumptions, the theoretical bound of the proposed algorithm is provided, which is the same as that of the exact Kernel Regularized Least Squares (KRLS).…”

Section: Theoretical Assessmentsmentioning

confidence: 99%

“…. Although Random Features (Rudi, Camoriano, and Rosasco 2016) and DC-Random Features (Li, Liu, and Wang 2019) can also obtain the same optimal convergence rate as DC-NY, their corresponding…”

Section: Theoretical Assessmentsmentioning

confidence: 99%

“…It computes KRLS in parallel by dividing into some subsets and then merge the result from each subset to get the final approximation. Recently, combinations of those accelerated algorithms have also captured a lot of attention, of which learning properties have been explored including the combination of divide-and-conquer and SGD (Lin and Cevher 2018) and the combination of divide-andconquer and random features (Li, Liu, and Wang 2019). Even though the state-of-the-art KRLS estimates can preserve the same optimal statistical accuracy of exact KRLS, the computational requirements of them are still prohibitive faced with large-scale datasets, namely, there are no corresponding computational lower bounds.…”

Section: Introductionmentioning

confidence: 99%

“…The code is from website 3 . (3) DC-RF: it represents the algorithm(Li, Liu, and Wang 2019): combining Random Features with Divide-and-Conquer. The code is from website 4 .…”

mentioning

confidence: 99%

See 4 more Smart Citations

Divide-and-Conquer Learning with Nyström: Optimal Rate and Algorithm

Yin

Liu

et al. 2020

AAAI

View full text Add to dashboard Cite

Kernel Regularized Least Squares (KRLS) is a fundamental learner in machine learning. However, due to the high time and space requirements, it has no capability to large scale scenarios. Therefore, we propose DC-NY, a novel algorithm that combines divide-and-conquer method, Nyström, conjugate gradient, and preconditioning to scale up KRLS, has the same accuracy of exact KRLS and the minimum time and space complexity compared to the state-of-the-art approximate KRLS estimates. We present a theoretical analysis of DC-NY, including a novel error decomposition with the optimal statistical accuracy guarantees. Extensive experimental results on several real-world large-scale datasets containing up to 1M data points show that DC-NY significantly outperforms the state-of-the-art approximate KRLS estimates.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Theoretical Assessmentsmentioning

confidence: 99%

Section: Theoretical Assessmentsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

“…The code is from website 3 . (3) DC-RF: it represents the algorithm(Li, Liu, and Wang 2019): combining Random Features with Divide-and-Conquer. The code is from website 4 .…”

mentioning

confidence: 99%

See 3 more Smart Citations