Accelerated Methods for NonConvex Optimization

Carmon, Yair; Duchi, John C.; Hinder, Oliver; Sidford, Aaron

doi:10.1137/17m1114296

Cited by 191 publications

(266 citation statements)

References 25 publications

Supporting

Mentioning

257

Contrasting

Unclassified

Order By: Relevance

“…Идея за-ключается в том, чтобы формировать матрицу Гессе оптимизируемой функции исходя из матриц Гессе относительно небольшого числа случайно выбранных слагаемых [Ghadimi et al, 2017]. Другая идея заключается в отказе от обращения матрицы Гессе на итерации, вместо этого пред-лагается использовать информацию о собственном векторе, отвечающем наименьшему собствен-ному значению [Agarwal et al, 2017;Carmon et al, 2017]. Для приближенного вычисления тако-го вектора вполне достаточно уметь умножать матрицу Гессе на произвольный вектор:…”

Section: Discussionunclassified

A hypothesis about the rate of global convergence for optimal methods (Newtons type) in smooth convex optimization

Gasnikov¹,

Kovalev²

2018

CRM

View full text Add to dashboard Cite

Section: Discussionunclassified

A hypothesis about the rate of global convergence for optimal methods (Newtons type) in smooth convex optimization

Gasnikov¹,

Kovalev²

2018

CRM

View full text Add to dashboard Cite

“…Carmon et al (Carmon et al, 2018) used negative curvature directions at the first phase of iterates and then switched it to accelerated stochastic gradient descent method when an iterate reaches an almost convex region.…”

Section: Related Workmentioning

confidence: 99%

Combining Stochastic Adaptive Cubic Regularization with Negative Curvature for Nonconvex Optimization

Park

Jung

Pardalos

2019

J Optim Theory Appl

View full text Add to dashboard Cite

We focus on minimizing nonconvex finite-sum functions that typically arise in machine learning problems. In an attempt to solve this problem, the adaptive cubic regularized Newton method has shown its strong global convergence guarantees and ability to escape from strict saddle points. This method uses a trust region-like scheme to determine if an iteration is successful or not, and updates only when it is successful.In this paper, we suggest an algorithm combining negative curvature with the adaptive cubic regularized Newton method to update even at unsuccessful iterations. We call this new method Stochastic Adaptive cubic regularization with Negative Curvature (SANC). Unlike the previous method, in order to attain stochastic gradient and Hessian estimators, the SANC algorithm uses independent sets of data points of consistent size over all iterations. It makes the SANC algorithm more practical to apply for solving large-scale machine learning problems. To the best of our knowledge, this is the first approach that combines the negative curvature method with the adaptive cubic regularized Newton method. Finally, we provide experimental results including neural networks problems supporting the efficiency of our method.

show abstract

“…In particular, accelerated inexact proximal points (AIPP) methods, i.e. methods which use an accelerated composite gradient variant to approximately solve the generated sequence of prox subproblems, have been developed for it (see, for example, [2,5]). When Y is not a singleton, (1) can no longer be directly solved by an AIPP method due to the nonsmoothness of the max term.…”

Section: Introductionmentioning

confidence: 99%

“…x − x ≤ δ (2) in at most O(δ −3 ) gradient and proximal subproblem evaluations. Next, we develop an AIPP-S variant that computes an approximate solution involving a saddle-point formulation of (1).…”

Section: Introductionmentioning

confidence: 99%

“…However, their method does not solve the perturbed problem using an accelerated method unlike the approach taken in this paper. Each of the methods in [9,10] consider notions of approximate solutions that are different from (2) and (3), making a comparison between these methods and the one presented in this paper not straightforward. We instead defer this discussion to Section 5 where it is shown that the AIPP-S method is more efficient when a common termination criterion is used.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An efficient adaptive accelerated inexact proximal point method for solving linearly constrained nonconvex composite problems

2020

View full text Add to dashboard Cite

This paper presents a quadratic-penalty type method for solving linearly-constrained composite nonconvex-concave min-max problems. The method consists of solving a sequence of penalty subproblems which, due to the min-max structure of the problem, are potentially nonsmooth but can be approximated by smooth composite nonconvex minimization problems. Each of these penalty subproblems is then solved by applying an accelerated inexact proximal point method to its corresponding smooth composite nonconvex approximation. Iteration complexity bounds for obtaining approximate stationary points of the linearly-constrained composite nonconvex-concave min-max problem are also established.

show abstract

Accelerated Methods for NonConvex Optimization

Cited by 191 publications

References 25 publications

A hypothesis about the rate of global convergence for optimal methods (Newtons type) in smooth convex optimization

A hypothesis about the rate of global convergence for optimal methods (Newtons type) in smooth convex optimization

Combining Stochastic Adaptive Cubic Regularization with Negative Curvature for Nonconvex Optimization

An efficient adaptive accelerated inexact proximal point method for solving linearly constrained nonconvex composite problems

Contact Info

Product

Resources

About