Frank E. Curtis – Page 3 – Optimization Online

Trust-Region Newton-CG with Strong Second-Order Complexity Guarantees for Nonconvex Optimization

Published: 2019/12/09, Updated: 2020/08/02

Worst-case complexity guarantees for nonconvex optimization algorithms have been a topic of growing interest. Multiple frameworks that achieve the best known complexity bounds among a broad class of first- and second-order strategies have been proposed. These methods have often been designed primarily with complexity guarantees in mind and, as a result, represent a departure from … Read more

A Fully Stochastic Second-Order Trust Region Method

Published: 2019/11/15

Frank E. Curtis

Rui Shi

Nonlinear Optimization, Stochastic Programming, Unconstrained Optimization deep neural networks, finite-sum optimization, machine learning, stochastic newton methods, stochastic optimization, time series forecasting, trust-region methods

A stochastic second-order trust region method is proposed, which can be viewed as a second-order extension of the trust-region-ish (TRish) algorithm proposed by Curtis et al. [INFORMS J. Optim. 1(3) 200–220, 2019]. In each iteration, a search direction is computed by (approximately) solving a trust region subproblem defined by stochastic gradient and Hessian estimates. The … Read more

Limited-Memory BFGS with Displacement Aggregation

Published: 2019/03/08, Updated: 2020/08/25

Albert S. Berahas

Frank E. Curtis

Baoyu Zhou

Nonlinear Optimization, Unconstrained Optimization broyden-fletcher-goldfarb-shanno (bfgs), limited memory bfgs, nonlinear optimization, quasi-newton methods, superlinear convergence

A displacement aggregation strategy is proposed for the curvature pairs stored in a limited-memory BFGS (a.k.a. L-BFGS) method such that the resulting (inverse) Hessian approximations are equal to those that would be derived from a full-memory BFGS method. This means that, if a sufficiently large number of pairs are stored, then an optimization algorithm employing … Read more

Gradient Sampling Methods for Nonsmooth Optimization

Published: 2018/04/29

Nonsmooth Optimization

This paper reviews the gradient sampling methodology for solving nonsmooth, nonconvex optimization problems. An intuitively straightforward gradient sampling algorithm is stated and its convergence properties are summarized. Throughout this discussion, we emphasize the simplicity of gradient sampling as an extension of the steepest descent method for minimizing smooth objectives. We then provide overviews of various … Read more

A Dynamic Penalty Parameter Updating Strategy for Matrix-Free Sequential Quadratic Optimization

Published: 2018/03/25

Constrained Nonlinear Optimization, Nonlinear Optimization alternating direction method, convex composite optimization, coordinate descent methods, exact penalty functions, nonlinear optimization, sequential quadratic programming

This paper focuses on the design of sequential quadratic optimization (commonly known as SQP) methods for solving large-scale nonlinear optimization problems. The most computationally demanding aspect of such an approach is the computation of the search direction during each iteration, for which we consider the use of matrix-free methods. In particular, we develop a method … Read more

ADMM for Multiaffine Constrained Optimization

Published: 2018/02/26, Updated: 2018/08/29

Frank E. Curtis

Wenbo Gao

Donald Goldfarb

Constrained Nonlinear Optimization, Nonsmooth Optimization admm, alternating direction method, multiaffine constraints, nonconvex optimization, nonlinear optimization

We propose an expansion of the scope of the alternating direction method of multipliers (ADMM). Specifically, we show that ADMM, when employed to solve problems with multiaffine constraints that satisfy certain easily verifiable assumptions, converges to the set of constrained stationary points if the penalty parameter in the augmented Lagrangian is sufficiently large. When the … Read more

Concise Complexity Analyses for Trust-Region Methods

Published: 2018/02/21, Updated: 2019/08/26

Frank E. Curtis

Daniel P. Robinson

Zachary Lubberts

Unconstrained Optimization global convergence, nonconvex optimization, nonlinear optimization, trust-region methods, unconstrained optimization, worst-case evaluation complexity, worst-case iteration-complexity

Concise complexity analyses are presented for simple trust region algorithms for solving unconstrained optimization problems. In contrast to a traditional trust region algorithm, the algorithms considered in this paper require certain control over the choice of trust region radius after any successful iteration. The analyses highlight the essential algorithm components required to obtain certain complexity … Read more

Regional Complexity Analysis of Algorithms for Nonconvex Smooth Optimization

Published: 2018/02/03, Updated: 2020/03/11

Frank E. Curtis

Daniel P. Robinson

Nonlinear Optimization, Unconstrained Optimization nonconvex optimization, nonlinear optimization, regularization, trust-region methods, worst-case evaluation complexity, worst-case iteration-complexity

A strategy is proposed for characterizing the worst-case performance of algorithms for solving nonconvex smooth optimization problems. Contemporary analyses characterize worst-case performance by providing, under certain assumptions on an objective function, an upper bound on the number of iterations (or function or derivative evaluations) required until a pth-order stationarity condition is approximately satisfied. This arguably … Read more

A Stochastic Trust Region Algorithm Based on Careful Step Normalization

Published: 2017/12/29, Updated: 2018/06/26

Frank E. Curtis

Katya Scheinberg

Rui Shi

Nonlinear Optimization, Stochastic Programming, Unconstrained Optimization deep neural networks, finite sum minimization, logistic regression, machine learning, stochastic gradient method, stochastic optimization, trust-region methods

An algorithm is proposed for solving stochastic and finite sum minimization problems. Based on a trust region methodology, the algorithm employs normalized steps, at least as long as the norms of the stochastic gradient estimates are within a specified interval. The complete algorithm—which dynamically chooses whether or not to employ normalized steps—is proved to have … Read more

An Accelerated Communication-Efficient Primal-Dual Optimization Framework for Structured Machine Learning

Published: 2017/11/14

Nonlinear Optimization, Parallel Algorithms

Distributed optimization algorithms are essential for training machine learning models on very large-scale datasets. However, they often suffer from communication bottlenecks. Confronting this issue, a communication-efficient primal-dual coordinate ascent framework (CoCoA) and its improved variant CoCoA+ have been proposed, achieving a convergence rate of $\mathcal{O}(1/t)$ for solving empirical risk minimization problems with Lipschitz continuous losses. … Read more