Stochastic Model-Based Minimization of Weakly Convex Functions

Davis, Damek; Drusvyatskiy, Dmitriy

doi:10.1137/18m1178244

Cited by 223 publications

(373 citation statements)

References 54 publications

Supporting

Mentioning

355

Contrasting

Unclassified

Order By: Relevance

“…An essential step in the analysis of stochastic recursive algorithms by the differential inclusion method is the chain rule on a path (see [9] and the references therein). For an absolutely continuous function p : [0, ∞) → Ê n we denote by • p(·) its weak derivative: a measurable function such that…”

Section: Generalized Subdifferentials Of Composite Functionsmentioning

confidence: 99%

Convergence of a stochastic subgradient method with averaging for nonsmooth nonconvex constrained optimization

Ruszczyński

2020

Optim Lett

View full text Add to dashboard Cite

We propose a single time-scale stochastic subgradient method for constrained optimization of a composition of several nonsmooth and nonconvex functions. The functions are assumed to be locally Lipschitz and differentiable in a generalized sense. Only stochastic estimates of the values and generalized derivatives of the functions are used. The method is parameter-free. We prove convergence with probability one of the method, by associating with it a system of differential inclusions and devising a nondifferentiable Lyapunov function for this system. For problems with functions having Lipschitz continuous derivatives, the method finds a point satisfying an optimality measure with error of order 1/ √ N, after executing N iterations with constant stepsize.

show abstract

Section: Generalized Subdifferentials Of Composite Functionsmentioning

confidence: 99%

Convergence of a stochastic subgradient method with averaging for nonsmooth nonconvex constrained optimization

Ruszczyński

2020

Optim Lett

View full text Add to dashboard Cite

show abstract

“…By providing a "relative" noise condition on f , Assumption A4 allows for a broader class of functions without global Lipschitz properties (as are typically assumed [8]), such as the phase retrieval and matrix completion objectives (Examples 1 and 2). It can allow exponential growth, addressing the challenges in Ex.…”

Section: Stability and Its Consequences For Weakly Convex Functionsmentioning

confidence: 99%

“…To describe convergence and stability guarantees in non-convex (even non-smooth) settings, we require appropriate definitions. Finding global minima of non-convex functions is computationally infeasible [26], so we follow established practice and consider convergence to stationary points, specifically using the convergence of the Moreau envelope [8,13]. To formalize, for x ∈ R n and λ ≥ 0, the Moreau envelope and associated proximal map are…”

Section: Stability and Its Consequences For Weakly Convex Functionsmentioning

confidence: 99%

“…For large enough λ, the minimizer x λ := prox F/λ (x) is unique whenever F is weakly convex. Adopting the techniques pioneered by Davis and Drusvyatskiy [8] for convergence of stochastic methods on weakly convex problems, our convergence machinery relies on the Moreau envelope's connections to (near) stationarity:…”

Section: Stability and Its Consequences For Weakly Convex Functionsmentioning

confidence: 99%

“…The three properties (8) imply that any nearly stationary point x of F λ (x)-when ∇F λ (x) 2 is small-is close to a nearly stationary point x λ of the original function F (·). To prove convergence for weakly convex methods, then, it is sufficient to show that ∇F λ (x k ) → 0.…”

Section: Stability and Its Consequences For Weakly Convex Functionsmentioning

confidence: 99%

See 2 more Smart Citations

The importance of better models in stochastic optimization

Asi

Duchi

2019

Proc. Natl. Acad. Sci. U.S.A.

View full text Add to dashboard Cite

Standard stochastic optimization methods are brittle, sensitive to stepsize choices and other algorithmic parameters, and they exhibit instability outside of well-behaved families of objectives. To address these challenges, we investigate models for stochastic optimization and learning problems that exhibit better robustness to problem families and algorithmic parameters. With appropriately accurate models-which we call the aProx family [2]-stochastic methods can be made stable, provably convergent and asymptotically optimal; even modeling that the objective is nonnegative is sufficient for this stability. We extend these results beyond convexity to weakly convex objectives, which include compositions of convex losses with smooth functions common in modern machine learning applications. We highlight the importance of robustness and accurate modeling with a careful experimental evaluation of convergence time and algorithm sensitivity.

show abstract

Gradient Sampling Methods for Nonsmooth Optimization

Burke

Curtis

Lewis

et al. 2020

Numerical Nonsmooth Optimization

View full text Add to dashboard Cite

This paper reviews the gradient sampling methodology for solving nonsmooth, nonconvex optimization problems. An intuitively straightforward gradient sampling algorithm is stated and its convergence properties are summarized. Throughout this discussion, we emphasize the simplicity of gradient sampling as an extension of the steepest descent method for minimizing smooth objectives. We then provide overviews of various enhancements that have been proposed to improve practical performance, as well as of several extensions that have been made in the literature, such as to solve constrained problems. The paper also includes clarification of certain technical aspects of the analysis of gradient sampling algorithms, most notably related to the assumptions one needs to make about the set of points at which the objective is continuously differentiable. Finally, we discuss possible future research directions.

show abstract

Stochastic Model-Based Minimization of Weakly Convex Functions

Cited by 223 publications

References 54 publications

Convergence of a stochastic subgradient method with averaging for nonsmooth nonconvex constrained optimization

Convergence of a stochastic subgradient method with averaging for nonsmooth nonconvex constrained optimization

The importance of better models in stochastic optimization

Gradient Sampling Methods for Nonsmooth Optimization

Contact Info

Product

Resources

About