A decoupled first/second-order steps technique for nonconvex nonlinear unconstrained optimization with improved complexity bounds

Gratton, Serge; Royer, Clément W.; Vicente, Luís Nunes

doi:10.1007/s10107-018-1328-7

Cited by 15 publications

(14 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, if χ f,1 (x k ) > ǫ 1 , one might simply require that χ m,1 (x k , s k , σ k ) ≤ θ s k p rather than (2.20) as this alone would aim to improve first-order criticality. However, though this decoupling is possible both in practice and in the analysis, it is not as straightforward as in the case of say, trust-region methods [12], as the lower bounds on the step in (3.3) and (3.4) depend on the objective's gradient and Hessian value at the next trial point/iterate, not the current x k . Also, one might modify the ARp algorithm to check the optimality measures (2.18) at every trial point, not just successful ones.…”

Section: Final Commentsmentioning

confidence: 99%

A concise second-order complexity analysis for unconstrained optimization using high-order regularized models

Cartis

Gould

Toint

2019

Optimization Methods and Software

View full text Add to dashboard Cite

The unconstrained minimization of a sufficiently smooth objective function f (x) is considered, for which derivatives up to order p, p ≥ 2, are assumed to be available. An adaptive regularization algorithm is proposed that uses Taylor models of the objective of order p and that is guaranteed to find a first-and second-order critical point in atfunction and derivatives evaluations, where ǫ 1 and ǫ 2 > 0 are prescribed first-and second-order optimality tolerances. Our approach extends the method in Birgin et al. (2016) to finding second-order critical points, and establishes the novel complexity bound for second-order criticality under identical problem assumptions as for first-order, namely, that the p-th derivative tensor is Lipschitz continuous and that f (x) is bounded from below. The evaluation-complexity bound for second-order criticality improves on all such known existing results.

show abstract

Section: Final Commentsmentioning

confidence: 99%

A concise second-order complexity analysis for unconstrained optimization using high-order regularized models

Cartis

Gould

Toint

2019

Optimization Methods and Software

View full text Add to dashboard Cite

show abstract

“…The difference can be explained * Version of December 12, 2017.by the restriction enforced by the trust-region constraint on the norm of the steps. Recent work has shown that it is possible to improve the bound for trust-region algorithms using specific definitions of the trust-region radius [13]. The best known iteration bound for a second-order algorithm (that is, an algorithm relying on the use of second-order derivatives and Newton-type steps) is O max −3/2 g , −3 H .…”

mentioning

confidence: 99%

Complexity Analysis of Second-Order Line-Search Algorithms for Smooth Nonconvex Optimization

Royer¹,

Wright²

2018

SIAM J. Optim.

Self Cite

View full text Add to dashboard Cite

There has been much recent interest in finding unconstrained local minima of smooth functions, due in part of the prevalence of such problems in machine learning and robust statistics. A particular focus is algorithms with good complexity guarantees. Second-order Newton-type methods that make use of regularization and trust regions have been analyzed from such a perspective. More recent proposals, based chiefly on first-order methodology, have also been shown to enjoy optimal iteration complexity rates, while providing additional guarantees on computational cost.In this paper, we present an algorithm with favorable complexity properties that differs in two significant ways from other recently proposed methods. First, it is based on line searches only: Each step involves computation of a search direction, followed by a backtracking line search along that direction. Second, its analysis is rather straightforward, relying for the most part on the standard technique for demonstrating sufficient decrease in the objective from backtracking. In the latter part of the paper, we consider inexact computation of the search directions, using iterative methods in linear algebra: the conjugate gradient and Lanczos methods. We derive modified convergence and complexity results for these more practical methods.Key words. smooth nonconvex unconstrained optimization, line-search methods, second-order methods, second-order necessary conditions, iteration complexity.AMS subject classifications. 49M05, 49M15, 90C06, 90C60. g −1 H , −3 H iterations [9]. Cubic regularization methods in their basic form [6] have better complexity bounds than trust-region schemes, requiring at most O max −2 g , −3 H iterations. The difference can be explained * Version of December 12, 2017.by the restriction enforced by the trust-region constraint on the norm of the steps. Recent work has shown that it is possible to improve the bound for trust-region algorithms using specific definitions of the trust-region radius [13]. The best known iteration bound for a second-order algorithm (that is, an algorithm relying on the use of second-order derivatives and Newton-type steps) is O max −3/2 g , −3 H . This bound was established originally (under the form of a global convergence rate) in [17], by considering cubic regularization of Newton's method. The same result is achieved by the adaptive cubic regularization framework under suitable assumptions on the computed step [9]. Recent proposals have shown that the same bound can be attained by algorithms other than cubic regularization. A modified trust-region method [11], a variable-norm trust-region scheme [16], and a quadratic regularization algorithm with cubic descent condition [2] all achieve the same bound. When g = H = for some ∈ (0, 1), all the bounds mentioned above reduce to O( −3 ). It has been established that this order is sharp for the class of second-order methods [9], and it can be proved for a wide range of algorithms that make use of second-order derivative information; see [12]. Setting H = 1/2 ...

show abstract

“…In more recent work, Garmanjani, Jùdice and Vicente (2016) provide a WCC bound of the form (2.2) for Algorithm 3, recovering essentially the same upper bound on the number of function evaluations required by DDS methods found in Vicente (2013), that is, a WCC bound in (see Table A.1). When , Gratton, Royer and Vicente (2019 a ) demonstrate a second-order WCC bound of the form (2.3) in ; in order to achieve this result, fully quadratic models are required. In Section 3.3, a similar result is achieved by using randomized variants that do not require a fully quadratic model in every iteration.…”

Section: Deterministic Methods For Deterministic Objectivesmentioning

confidence: 99%

“…1 − p1 DDS [Gratton et al, 2015] C mnL 2 g (f (x0) − f (x * )) 2 ∇f (x k ) ≤ w.p. 1 − p2 TR [Gratton et al, 2018] C,D m max{κ ef , κeg} 2 (f (x0) − f (x * )) 2 f ∈ LC 2 max{ ∇f (x k ) , −λ k } ≤ DDS [Gratton et al, 2016] n 5 max{LH, Lg} 3 (f (x0) − f (x * )) 3 TR[Gratton et al, 2019a] n 5 max{L 3 H , L 2 g }(f (x0) − f (x * )) 3 max{ ∇f (x k ) , −λ k } ≤ w.p. 1 − p3 TR [Gratton et al, 2018] C,D m max{κeg, κeH} 3 (f (x0) − f (x * ))…”

mentioning

confidence: 99%

Derivative-free optimization methods

2019

View full text Add to dashboard Cite

Dedicated to the memory of Andrew R. Conn for his inspiring enthusiasm and his many contributions to the renaissance of derivative-free optimization methods. AbstractIn many optimization problems arising from scientific, engineering and artificial intelligence applications, objective and constraint functions are available only as the output of a black-box or simulation oracle that does not provide derivative information. Such settings necessitate the use of methods for derivative-free, or zeroth-order, optimization. We provide a review and perspectives on developments in these methods, with an emphasis on highlighting recent developments and on unifying treatment of such problems in the non-linear optimization and machine learning literature. We categorize methods based on assumed properties of the black-box functions, as well as features of the methods. We first overview the primary setting of deterministic methods applied to unconstrained, non-convex optimization problems where the objective function is defined by a deterministic black-box oracle. We then discuss developments in randomized methods, methods that assume some additional structure about the objective (including convexity, separability and general non-smooth compositions), methods for problems where the output of the black-box oracle is stochastic, and methods for handling different types of constraints.

show abstract

A decoupled first/second-order steps technique for nonconvex nonlinear unconstrained optimization with improved complexity bounds

Cited by 15 publications

References 25 publications

A concise second-order complexity analysis for unconstrained optimization using high-order regularized models

A concise second-order complexity analysis for unconstrained optimization using high-order regularized models

Complexity Analysis of Second-Order Line-Search Algorithms for Smooth Nonconvex Optimization

Derivative-free optimization methods

Contact Info

Product

Resources

About