Scaling Stratified Stochastic Gradient Descent for Distributed Matrix Completion

Distributed asynchronous stochastic gradient descent (ASGD) algorithms that approximate low-rank matrix factorizations for collaborative filtering perform one or more synchronizations per epoch where staleness is reduced with more synchronizations. However, high number of synchronizations would prohibit the scalability of the algorithm. We propose a parallel ASGD algorithm, η-PASGD, for efficiently handling η synchronizations per epoch in a scalable fashion. The proposed algorithm puts an upper limit of K on η, for a K-processor system, such that performing η = K synchronizations per epoch would eliminate the staleness completely. The rating data used in collaborative filtering are usually represented as sparse matrices. The sparsity allows for reduction in the staleness and communication overhead combinatorially via intelligently distributing the data to processors. We analyze the staleness and the total volume incurred during an epoch of η-PASGD. Following this analysis, we propose a hypergraph partitioning model to encapsulate reducing staleness and volume while minimizing the maximum number of synchronizations required for a stale-free SGD. This encapsulation is achieved with a novel cutsize metric that is realized via a new recursive-bipartitioning-based algorithm. Experiments on up to 512 processors show the importance of the proposed partitioning method in improving staleness, volume, RMSE and parallel runtime.

show abstract

Minimizing Staleness and Communication Overhead in Distributed SGD for Collaborative Filtering

Abubaker

Caglayan

Karsavuran

et al. 2023

IEEE Trans. Comput.

Self Cite

View full text Add to dashboard Cite

show abstract

Leveraged Matrix Completion With Noise

Huang,

Liu,

et al. 2024

IEEE Trans. Cybern.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Scaling Stratified Stochastic Gradient Descent for Distributed Matrix Completion

Cited by 2 publications

References 21 publications

Minimizing Staleness and Communication Overhead in Distributed SGD for Collaborative Filtering

Minimizing Staleness and Communication Overhead in Distributed SGD for Collaborative Filtering

Leveraged Matrix Completion With Noise

Contact Info

Product

Resources

About