2019
DOI: 10.48550/arxiv.1911.08510
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Optimal Complexity and Certification of Bregman First-Order Methods

Radu-Alexandru Dragomir,
Adrien Taylor,
Alexandre d'Aspremont
et al.

Abstract: We provide a lower bound showing that the O(1/k) convergence rate of the NoLips method (a.k.a. Bregman Gradient or Mirror Descent) is optimal for the class of functions satisfying the h-smoothness assumption. This assumption, also known as relative smoothness, appeared in the recent developments around the Bregman Gradient method, where acceleration remained an open issue.On the way, we show how to constructively obtain the corresponding worst-case functions by extending the computer-assisted performance estim… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
8
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(11 citation statements)
references
References 32 publications
(96 reference statements)
1
8
0
Order By: Relevance
“…Firstly, in terms of the dependence on β and µ our algorithm achieves the lower bound (13) obtained in [31]. Secondly, we compare our bound with the bound ( 14) of the DISCO algorithm [32], which unlike other works [23], [24], [25], [26], [27], [28], [29], [30] also achieves the lower bound in terms of the dependence on β and µ. Since the dependence on these parameters in (14) and in our bound (34) are the same, we compare the other parts of the complexity bound.…”
Section: Achieving the Lower Bound For Finite-sum Optimization Under ...mentioning
confidence: 64%
See 1 more Smart Citation
“…Firstly, in terms of the dependence on β and µ our algorithm achieves the lower bound (13) obtained in [31]. Secondly, we compare our bound with the bound ( 14) of the DISCO algorithm [32], which unlike other works [23], [24], [25], [26], [27], [28], [29], [30] also achieves the lower bound in terms of the dependence on β and µ. Since the dependence on these parameters in (14) and in our bound (34) are the same, we compare the other parts of the complexity bound.…”
Section: Achieving the Lower Bound For Finite-sum Optimization Under ...mentioning
confidence: 64%
“…This idea have been recently extensively exploited for optimization problems (mainly) over master/workers architectures, under the name of statistical preconditioning [23], [24], [25], [26], [27], [28], [29], [30]. These papers focus on solving the finite-sum problem (7) and most of them do not achieve the lower communication complexity bound for this setting obtained in [31]…”
Section: Introductionmentioning
confidence: 99%
“…An obvious one is incorporating acceleration to improve communication complexity bounds. The results of (Hendrikx et al, 2020a) and (Dragomir et al, 2019) imply that the performance of first-order methods is limited even for centralized architectures. Secondly, other ERM algorithms should be evaluated through the lens of main target being statistical precision.…”
Section: Discussionmentioning
confidence: 99%
“…A direct acceleration of the mirror method, achieving O(β/µ) over star-networks [Lu et al, 2020], does not seem possible in general [Dragomir et al, 2019].…”
Section: Contributionsmentioning
confidence: 99%
“…For quadratic losses, DANE achieves communication complexity O((β/µ) 2 log 1/ε). More recently, [Fan et al, 2019] proposed CEASE, which achieves DANE's complexity but for nonquadratic losses and r = 0. Applying the convergence analysis of mirror descent in [Lu et al, 2020] to CEASE enhances its rate to O((β/µ) log 1/ε).…”
Section: Related Workmentioning
confidence: 99%