2017
DOI: 10.48550/arxiv.1710.11606
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Lower Bounds for Finding Stationary Points I

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
84
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(87 citation statements)
references
References 0 publications
3
84
0
Order By: Relevance
“…Optimal rates for finding second-order stationary points. Carmon et al [2017a] have presented lower bounds that imply that GD achieves the optimal rate for finding stationary points for gradient Lipschitz functions. In our setting, we additionally assume that the Hessian is Lipschitz.…”
Section: Discussionmentioning
confidence: 99%
“…Optimal rates for finding second-order stationary points. Carmon et al [2017a] have presented lower bounds that imply that GD achieves the optimal rate for finding stationary points for gradient Lipschitz functions. In our setting, we additionally assume that the Hessian is Lipschitz.…”
Section: Discussionmentioning
confidence: 99%
“…The iteration complexity order O(ε −2 log(ε −1 )) in Theorem 3.4 is tight (up to logarithmic factors). This is due to the fact that for general non-convex smooth problems, finding an ε-stationary solution requires at least Ω(ε −2 ) gradient evaluations [7,47]. Clearly, this lower bound is also valid for finding an ε-FNE of PL-games.…”
Section: Convergence Analysis Of Multi-step Gradient Descent Ascent A...mentioning
confidence: 99%
“…We consider intermittent communication algorithms, which attempt to optimize F using M parallel workers, each of which is allowed K queries to g in each of R rounds of communication. Such intermittent communication algorithms can be formalized using the graph oracle framework of Woodworth et al (2018) which focuses on the dependence structure between different stochastic gradient computations, and to facilitate our lower bounds, we follow Carmon et al (2017) and focus our attention on distributed zero-respecting algorithms:…”
Section: Setting and Notationmentioning
confidence: 99%
“…Furthermore, work in other contexts has succeeded in proving that the min-max complexity for arbitrary randomized algorithms often matches the min-max complexity for zero-respecting algorithms, but at the expense of much more complicated proofs (e.g. Woodworth and Srebro, 2016;Carmon et al, 2017;Arjevani et al, 2020).…”
Section: Setting and Notationmentioning
confidence: 99%