A second-order method for convex<sub>1</sub>-regularized optimization with active-set prediction

Keskar, Nitish Shirish; Nocedal, Jorge; Oztoprak, Figen; Wächter, Andreas

doi:10.1080/10556788.2016.1138222

Cited by 24 publications

(17 citation statements)

References 31 publications

(64 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since U ∈ ℜ n×n is a diagonal matrix, at the first glance, the costs of computing AU A T and the matrixvector multiplication AU A T d for a given vector d ∈ ℜ m are O(m 2 n) and O(mn), respectively. These computational costs are too expensive when the dimensions of A are large and can make the commonly employed approaches such as the Cholesky factorization and the conjugate gradient method inappropriate for solving (26). Fortunately, under the sparse optimization setting, if the sparsity of U is wisely taken into the consideration, one can substantially reduce these unfavorable computational costs to a level such that they are negligible or at least insignificant compared to other costs.…”

Section: An Efficient Implementation Of Ssn For Solving Subproblems (18)mentioning

confidence: 99%

“…See Figure 2 for an illustration on the computation of A T J A J . In this case, the total computational costs for solving the Newton linear system (26) are reduced significantly further from O(m 2 (m + r)) to O(r 2 (m + r)). We should emphasize here that this dramatic reduction on the computational costs results from the wise combination of the careful examination of the existing second order sparsity in the Lasso-type problems and some "smart" numerical linear algebra.…”

Section: An Efficient Implementation Of Ssn For Solving Subproblems (18)mentioning

confidence: 99%

See 1 more Smart Citation

A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems

2018

View full text Add to dashboard Cite

We develop a fast and robust algorithm for solving large scale convex composite optimization models with an emphasis on the ℓ 1 -regularized least squares regression (Lasso) problems. Despite the fact that there exist a large number of solvers in the literature for the Lasso problems, we found that no solver can efficiently handle difficult large scale regression problems with real data. By leveraging on available error bound results to realize the asymptotic superlinear convergence property of the augmented Lagrangian algorithm, and by exploiting the second order sparsity of the problem through the semismooth Newton method, we are able to propose an algorithm, called Ssnal, to efficiently solve the aforementioned difficult problems. Under very mild conditions, which hold automatically for Lasso problems, both the primal and the dual iteration sequences generated by Ssnal possess a fast linear convergence rate, which can even be superlinear asymptotically. Numerical comparisons between our approach and a number of state-of-the-art solvers, on real data sets, are presented to demonstrate the high efficiency and robustness of our proposed algorithm in solving difficult large scale Lasso problems.

show abstract

Section: An Efficient Implementation Of Ssn For Solving Subproblems (18)mentioning

confidence: 99%

Section: An Efficient Implementation Of Ssn For Solving Subproblems (18)mentioning

confidence: 99%

A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems

2018

View full text Add to dashboard Cite

show abstract

“…The active set strategies have also been studied in [65,107]. Specifically, the method in [65] solves a smooth quadratic subproblem determined by the active sets and invokes a corrective cycle that greatly improves the efficiency and robustness of the algorithm. The method is globalized by using a proximal gradient step to check the desired progress.…”

Section: Active Set Methodsmentioning

confidence: 99%

“…A subset of the variables is fixed in the so-called active sets determined by certain mechanisms and the remaining variables are computed from carefully constructed subproblems. Examples include optimization problems with bound constraints or linear constraints in [17,18,53,82,83], ℓ 1 -regularized problem for sparse optimization in [65,107,135] and general nonlinear programs in [19,20]. In quadratic programming, the inequality constraints that have zero values at the optimal solution are called active, and they are replaced by equality constraints in the subproblem [113].…”

Section: Bcdmentioning

confidence: 99%

Subspace Methods for Nonlinear Optimization

Liu¹

2021

CSIAM-AM

View full text Add to dashboard Cite

Subspace techniques such as Krylov subspace methods have been well known and extensively used in numerical linear algebra. They are also ubiquitous and becoming indispensable tools in nonlinear optimization due to their ability to handle large scale problems. There are generally two types of principals: i) the decision variable is updated in a lower dimensional subspace; ii) the objective function or constraints are approximated in a certain smaller subspace of their domain. The key ingredients are the constructions of suitable subspaces and subproblems according to the specific structures of the variables and functions such that either the exact or inexact solutions of subproblems are readily available and the corresponding computational cost is significantly reduced. A few relevant techniques include but not limited to direct combinations, block coordinate descent, active sets, limited-memory, Anderson acceleration, subspace correction, sampling and sketching. This paper gives a comprehensive survey on the subspace methods and their recipes in unconstrained and constrained optimization, nonlinear least squares problem, sparse and low rank optimization, linear and nonlinear eigenvalue computation, semidefinite programming, stochastic optimization and etc. In order to provide helpful guidelines, we emphasize on high level concepts for the development and implementation of practical algorithms from the subspace framework.

show abstract

“…These are just two of many algorithms that incorporate working sets to speed up sparse optimization. For lasso-type problems, many additional studies combine working set (Scheinberg and Tang, 2016;Massias et al, 2017) or active set (Wen et al, 2012;Solntsev et al, 2015;Keskar et al) strategies with standard algorithms. Researchers have also applied working sets to many other sparse problems-see e.g.…”

Section: Relation To Prior Algorithmsmentioning

confidence: 99%

A Fast, Principled Working Set Algorithm for Exploiting Piecewise Linear Structure in Convex Problems

Johnson¹,

Guestrin²

2018

Preprint

View full text Add to dashboard Cite

By reducing optimization to a sequence of smaller subproblems, working set algorithms achieve fast convergence times for many machine learning problems. Despite such performance, working set implementations often resort to heuristics to determine subproblem size, makeup, and stopping criteria. We propose BlitzWS, a working set algorithm with useful theoretical guarantees. Our theory relates subproblem size and stopping criteria to the amount of progress during each iteration. This result motivates strategies for optimizing algorithmic parameters and discarding irrelevant components as BlitzWS progresses toward a solution. BlitzWS applies to many convex problems, including training 1 -regularized models and support vector machines. We showcase this versatility with empirical comparisons, which demonstrate BlitzWS is indeed a fast algorithm.

show abstract

A second-order method for convex₁-regularized optimization with active-set prediction

Cited by 24 publications

References 31 publications

A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems

A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems

Subspace Methods for Nonlinear Optimization

A Fast, Principled Working Set Algorithm for Exploiting Piecewise Linear Structure in Convex Problems

Contact Info

Product

Resources

About

A second-order method for convex1-regularized optimization with active-set prediction

Cited by 24 publications

References 31 publications

A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems

A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems

Subspace Methods for Nonlinear Optimization

A Fast, Principled Working Set Algorithm for Exploiting Piecewise Linear Structure in Convex Problems

Contact Info

Product

Resources

About

A second-order method for convex₁-regularized optimization with active-set prediction