Nested Distributed Gradient Methods with Stochastic Computation Errors

Iakovidou, Charikleia; Wei, Ermin

doi:10.1109/allerton.2019.8919853

Cited by 2 publications

(3 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our method is based on a class of flexible algorithms (NEAR-DGD) [57] that permit the trade-off of computation and communication to best accommodate the application setting. In this work, we generalize our previous results analyzing NEAR-DGD in the presence of either deterministically quantized communication [58] or stochastic gradient errors [59], and unify them under a common, fully stochastic framework. We provide theoretical results to demonstrate that S-NEAR-DGD converges to a neighborhood of the optimal solution with geometric rate, and that if an error-correction mechanism is incorporated to consensus, then the total communication error induced by inexact communication is independent of the number of consensus rounds peformed by our algorithm.…”

Section: B Contributionssupporting

confidence: 56%

See 1 more Smart Citation

S-NEAR-DGD: A Flexible Distributed Stochastic Gradient Method for Inexact Communication

Iakovidou¹,

Wei²

2023

IEEE Trans. Automat. Contr.

Self Cite

View full text Add to dashboard Cite

We present and analyze a stochastic distributed method (S-NEAR-DGD) that can tolerate inexact computation and inaccurate information exchange to alleviate the problems of costly gradient evaluations and bandwidth-limited communication in large-scale systems. Our method is based on a class of flexible, distributed first order algorithms that allow for the trade-off of computation and communication to best accommodate the application setting. We assume that all the information exchange between nodes is subject to random distortion and that only stochastic approximations of the true gradients are available. Our theoretical results prove that the proposed algorithm converges linearly in expectation to a neighborhood of the optimal solution for strongly convex objective functions with Lipschitz gradients. We characterize the dependence of this neighborhood on algorithm and network parameters, the quality of the communication channel and the precision of the stochastic gradient approximations used. Finally, we provide numerical results to evaluate the empirical performance of our method.

show abstract

Section: B Contributionssupporting

confidence: 56%

“…We considered quantized communication using deterministic (D) algorithms (e.g. rounding to the nearest integer with no uncertainty) in [58], while a variant of NEAR-DGD that utilizes stochastic gradient approximations only was presented in [59]. This work unifies and generalizes these methods.…”

Section: Methods Communication Computationmentioning

confidence: 99%

S-NEAR-DGD: A Flexible Distributed Stochastic Gradient Method for Inexact Communication

Iakovidou¹,

Wei²

2023

IEEE Trans. Automat. Contr.

Self Cite

View full text Add to dashboard Cite

show abstract

“…1. distributed first-order primal algorithms [15][16][17][18][19][20][21][22][23][24]: arXiv:2006.01665v1 [math.OC] 31 May 2020 methods that use only gradient information and operate in primal space (i.e., directly on problem (1.3)); 2. nested [1,[25][26][27][28]: methods that decompose the communication and computation steps and perform them sequentially;…”

Section: Introductionmentioning

confidence: 99%

On the Convergence of Nested Decentralized Gradient Methods With Multiple Consensus and Gradient Steps

Berahas

Bollapragada

Wei

2021

IEEE Trans. Signal Process.

Self Cite

View full text Add to dashboard Cite

In this paper, we consider minimizing a sum of local convex objective functions in a distributed setting, where the cost of communication and/or computation can be expensive. We extend and generalize the analysis for a class of nested gradientbased distributed algorithms (NEAR-DGD, [1]) to account for multiple gradient steps at every iteration. We show the effect of performing multiple gradient steps on the rate of convergence and on the size of the neighborhood of convergence, and prove R-Linear convergence to the exact solution with a fixed number of gradient steps and increasing number of consensus steps. We test the performance of the generalized method on quadratic functions and show the effect of multiple consensus and gradient steps in terms of iterations, number of gradient evaluations, number of communications and cost.

show abstract

Nested Distributed Gradient Methods with Stochastic Computation Errors

Cited by 2 publications

References 48 publications

S-NEAR-DGD: A Flexible Distributed Stochastic Gradient Method for Inexact Communication

S-NEAR-DGD: A Flexible Distributed Stochastic Gradient Method for Inexact Communication

On the Convergence of Nested Decentralized Gradient Methods With Multiple Consensus and Gradient Steps

Contact Info

Product

Resources

About