Convergence Rates of Distributed Nesterov-Like Gradient Methods on Random Networks

Jakovetić, Dušan; Xavier, João; Moura, J.M.F.

doi:10.1109/tsp.2013.2291221

Cited by 44 publications

(46 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We can modify our methods and relax these prior knowledge requirements such that the methods still provable converge, at rates that are close to the ones presented in this paper; for details, we refer to [10].…”

Section: Algorithm Md-ncmentioning

confidence: 92%

“…In subsequent results, ξ denotes an arbitrarily small positive number. A proof of Theorem 1, as well as explicit constants in the established rates, can be found in [10]. Theorem 1 indicates that the convergence rates do not depend on the underlying random network statistics.…”

Section: Convergence Analysismentioning

confidence: 93%

“…In other words, mD-NG, when compared with D-NG, introduces an additional per-node communication at each iteration k. This allows with mD-NG for structural robustness to random variations in W (k). In contrast with mD-NG, D-NG may diverge when the network is random; we refer to a companion journal paper [10] for details. This interesting difference between mD-NG and D-NG may be, in a certain sense, related to the difference between adapt-then-combine and combine-then-adapt methods studied in [11].…”

Section: Algorithm Md-ngmentioning

confidence: 99%

“…From Theorem 1, we can see that mD-NC achieves faster theoretical rates than mD-NG. Typically, in simulations, mD-NG actually converges faster for practical accuracies, see [10].…”

Section: Convergence Analysismentioning

confidence: 99%

See 3 more Smart Citations

Distributed Nesterov gradient methods for random networks: Convergence in probability and convergence rates

Jakovetić

Moura

2014

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

We consider distributed optimization where N nodes in a generic, connected network minimize the sum of their individual, locally known, convex costs. Existing literature proposes distributed gradient-like methods that are attractive due to computationally cheap iterations and provable resilience to random inter-node communication failures, but such methods have slow theoretical and empirical convergence rates. Building from the centralized Nesterov gradient methods, we propose accelerated distributed gradient-like methods and establish that they achieve strictly faster rates than existing distributed methods. At the same time, our methods maintain cheap iterations and resilience to random communication failures. Specifically, for convex, differentiable local costs with Lipschitz continuous and bounded derivative, we establish (with respect to the cost function optimality) convergence in probability and convergence rates in expectation and in second moment.

show abstract

Section: Algorithm Md-ncmentioning

confidence: 92%

Section: Convergence Analysismentioning

confidence: 93%

Section: Algorithm Md-ngmentioning

confidence: 99%

“…From Theorem 1, we can see that mD-NC achieves faster theoretical rates than mD-NG. Typically, in simulations, mD-NG actually converges faster for practical accuracies, see [10].…”

Section: Convergence Analysismentioning

confidence: 99%

See 2 more Smart Citations

Distributed Nesterov gradient methods for random networks: Convergence in probability and convergence rates

Jakovetić

Moura

2014

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

show abstract

“…We consider standard distributed stochastic gradient methods where at each time step, each node makes a weighted average of its own and its neighbors' solution estimates, and performs a step in the negative direction of its noisy local gradient. The underlying network is allowed to be randomly varying, similarly to, e.g., the models in [4]- [6]. More specifically, the network is modeled through a sequence of independent identically distributed (i.i.d.)…”

Section: Introductionmentioning

confidence: 99%

Convergence Rates for Distributed Stochastic Optimization Over Random Networks

Jakovetić

Bajović

Sahu

et al. 2018

2018 IEEE Conference on Decision and Control (CDC)

Self Cite

View full text Add to dashboard Cite

We establish the O( 1 k ) convergence rate for distributed stochastic gradient methods that operate over strongly convex costs and random networks. The considered class of methods is standard each node performs a weighted average of its own and its neighbors solution estimates (consensus), and takes a negative step with respect to a noisy version of its local functions gradient (innovation). The underlying communication network is modeled through a sequence of temporally independent identically distributed (i.i.d.) Laplacian matrices connected on average, while the local gradient noises are also i.i.d. in time, have finite second moment, and possibly unbounded support. We show that, after a careful setting of the consensus and innovations potentials (weights), the distributed stochastic gradient method achieves a (order-optimal) O( 1 k ) convergence rate in the mean square distance from the solution. This is the first order-optimal convergence rate result on distributed strongly convex stochastic optimization when the network is random and/or the gradient noises have unbounded support. Simulation examples confirm the theoretical findings.

show abstract

Fixed‐time distributed optimization for multi‐agent systems with external disturbances over directed networks

Sun

et al. 2022

Intl J Robust & Nonlinear

View full text Add to dashboard Cite

This article considers the fixed-time distributed optimization problem of multi-agent systems with external disturbances, in which the global optimization objective is a convex combination of local objective functions. To solve this issue, a directed communication network is carefully designed, and an integral sliding mode control protocol is proposed based on the gradient of global objective function first. Moreover, two distributed optimal protocols are designed by using the gradient and the Hessian matrix of local objective function, respectively. By employing Lyapunov stability theory, graph theory, convex analysis, and inequality techniques, we prove that all proposed protocols can make agents achieve consensus and converge accurately to the optimal solution of the considered problem in some fixed-time intervals. Finally, some numerical simulations are given to verify the feasibility of the theoretical results.

show abstract

Convergence Rates of Distributed Nesterov-Like Gradient Methods on Random Networks

Cited by 44 publications

References 38 publications

Distributed Nesterov gradient methods for random networks: Convergence in probability and convergence rates

Distributed Nesterov gradient methods for random networks: Convergence in probability and convergence rates

Convergence Rates for Distributed Stochastic Optimization Over Random Networks

Fixed‐time distributed optimization for multi‐agent systems with external disturbances over directed networks

Contact Info

Product

Resources

About