Alex Olshevsky scite author profile

This paper considers the problem of distributed optimization over time-varying graphs. For the case of undirected graphs, we introduce a distributed algorithm, referred to as DIGing, based on a combination of a distributed inexact gradient method and a gradient tracking technique. The DIGing algorithm uses doubly stochastic mixing matrices and employs fixed step-sizes and, yet, drives all the agents' iterates to a global and consensual minimizer. When the graphs are directed, in which case the implementation of doubly stochastic mixing matrices is unrealistic, we construct an algorithm that incorporates the push-sum protocol into the DIGing structure, thus obtaining Push-DIGing algorithm. The Push-DIGing uses column stochastic matrices and fixed step-sizes, but it still converges to a global and consensual minimizer. Under the strong convexity assumption, we prove that the algorithms converge at R-linear (geometric) rates as long as the step-sizes do not exceed some upper bounds. We establish explicit estimates for the convergence rates. When the graph is undirected it shows that DIGing scales polynomially in the number of agents. We also provide some numerical experiments to demonstrate the efficacy of the proposed algorithms and to validate our theoretical findings.

show abstract

Distributed optimization over time-varying directed graphs

Nedić

Olshevsky

2013

320

737

View full text Add to dashboard Cite

We consider distributed optimization by a collection of nodes, each having access to its own convex function, whose collective goal is to minimize the sum of the functions. The communications between nodes are described by a timevarying sequence of directed graphs, which is uniformly strongly connected. For such communications, assuming that every node knows its out-degree, we develop a broadcast-based algorithm, termed the subgradient-push, which steers every node to an optimal value under a standard assumption of subgradient boundedness. The subgradient-push requires no knowledge of either the number of agents or the graph sequence to implement. Our analysis shows that the subgradient-push algorithm converges at a rate of O ln t/ √ t , where the constant depends on the initial values at the nodes, the subgradient norms, and, more interestingly, on both the consensus speed and the imbalances of influence among the nodes. 52nd IEEE Conference on Decision and Control December 10-13, 2013. Florence, Italy 978-1-4673-5717-3/13/$31.00 ©2013 IEEE 6855

show abstract

Convergence Speed in Distributed Consensus and Averaging

Olshevsky¹,

Tsitsiklis²

2009

SIAM J. Control Optim.

516

388

View full text Add to dashboard Cite

Abstract. We study the convergence speed of distributed iterative algorithms for the consensus and averaging problems, with emphasis on the latter. We first consider the case of a fixed communication topology. We show that a simple adaptation of a consensus algorithm leads to an averaging algorithm. We prove lower bounds on the worst-case convergence time for various classes of linear, time-invariant, distributed consensus methods, and provide an algorithm that essentially matches those lower bounds. We then consider the case of a time-varying topology, and provide a polynomial-time averaging algorithm.

show abstract

Network Topology and Communication-Computation Tradeoffs in Decentralized Optimization

Nedić

Olshevsky

Rabbat³

2018

Proc. IEEE

414

329

View full text Add to dashboard Cite

In decentralized optimization, nodes cooperate to minimize an overall objective function that is the sum (or average) of per-node private objective functions. Algorithms interleave local computations with communication among all or a subset of the nodes. Motivated by a variety of applications-decentralized estimation in sensor networks, fitting models to massive data sets, and decentralized control of multi-robot systems, to name a fewsignificant advances have been made towards the development of robust, practical algorithms with theoretical performance guarantees. This paper presents an overview of recent work in this area. In general, rates of convergence depend not only on the number of nodes involved and the desired level of accuracy, but also on the structure and nature of the network over which nodes communicate (e.g., whether links are directed or undirected, static or time-varying). We survey the state-of-the-art algorithms and their analyses tailored to these different scenarios, highlighting the role of the network topology.A. Nedić is with the

show abstract

Federated learning of predictive models from federated Electronic Health Records

Brisimi

Chen

Mela

et al. 2018

International Journal of Medical Informatics

614

318

View full text Add to dashboard Cite

We test cPDS on the problem of predicting hospitalizations due to heart diseases within a calendar year based on information in the patients Electronic Health Records prior to that year. cPDS converges faster than centralized methods at the cost of some communication between agents. It also converges faster and with less communication overhead compared to an alternative distributed algorithm. In both cases, it achieves similar prediction accuracy measured by the Area Under the Receiver Operating Characteristic Curve (AUC) of the classifier. We extract important features discovered by the algorithm that are predictive of future hospitalizations, thus providing a way to interpret the classification results and inform prevention efforts.

show abstract

Fast Convergence Rates for Distributed Non-Bayesian Learning

Nedić

Olshevsky

Uribe

2017

IEEE Trans. Automat. Contr.

182

242

View full text Add to dashboard Cite

We consider the problem of distributed learning, where a network of agents collectively aim to agree on a hypothesis that best explains a set of distributed observations of conditionally independent random processes. We propose a distributed algorithm and establish consistency, as well as a non-asymptotic, explicit and geometric convergence rate for the concentration of the beliefs around the set of optimal hypotheses. Additionally, if the agents interact over static networks, we provide an improved learning protocol with better scalability with respect to the number of nodes in the network.

show abstract

Stochastic Gradient-Push for Strongly Convex Functions on Time-Varying Directed Graphs

Nedić

Olshevsky

2016

IEEE Trans. Automat. Contr.

267

198

View full text Add to dashboard Cite

We investigate the convergence rate of the recently proposed subgradient-push method for distributed optimization over time-varying directed graphs. The subgradient-push method can be implemented in a distributed way without requiring knowledge of either the number of agents or the graph sequence; each node is only required to know its out-degree at each time. Our main result is a convergence rate of O ((ln t)/t) for strongly convex functions with Lipschitz gradients even if only stochastic gradient samples are available; this is asymptotically faster than the O (ln t)/ √ t rate previously known for (general) convex functions.

show abstract

Convergence Speed in Distributed Consensus and Averaging

Olshevsky¹,

Tsitsiklis²

2011

SIAM Rev.

195

178

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Alex Olshevsky

Achieving Geometric Convergence for Distributed Optimization Over Time-Varying Graphs

Distributed optimization over time-varying directed graphs

Convergence Speed in Distributed Consensus and Averaging

Network Topology and Communication-Computation Tradeoffs in Decentralized Optimization

Federated learning of predictive models from federated Electronic Health Records

Fast Convergence Rates for Distributed Non-Bayesian Learning

Stochastic Gradient-Push for Strongly Convex Functions on Time-Varying Directed Graphs

Convergence Speed in Distributed Consensus and Averaging

Contact Info

Product

Resources

About