first-order methods – Optimization Online

Provable and Practical Online Learning Rate Adaptation with Hypergradient Descent

Published: 2025/02/14, Updated: 2025/03/16

This paper investigates the convergence properties of the hypergradient descent method (HDM), a 25-year-old heuristic originally proposed for adaptive stepsize selection in stochastic first-order methods. We provide the first rigorous convergence analysis of HDM using the online learning framework of [Gao24] and apply this analysis to develop new state-of-the-art adaptive gradient methods with empirical and … Read more

Gradient Methods with Online Scaling

Published: 2024/11/06, Updated: 2024/11/17

Wenzhi Gao

Convex Optimization, Nonlinear Optimization, Other Topics first-order methods, online convex optimization, preconditioning

We introduce a framework to accelerate the convergence of gradient-based methods with online learning. The framework learns to scale the gradient at each iteration through an online learning algorithm and provably accelerates gradient-based methods asymptotically. In contrast with previous literature, where convergence is established based on worst-case analysis, our framework provides a strong convergence guarantee … Read more

Efficient parameter-free restarted accelerated gradient methods for convex and strongly convex optimization

Published: 2024/10/05, Updated: 2024/10/11

Arnesh Sujanani

Renato D.C. Monteiro

Convex and Nonsmooth Optimization, Convex Optimization, Nonlinear Optimization complexity, convex optimization, first-order methods, parameter free methods, restarted accelerated method, strongly convex optimization

This paper develops a new parameter-free restarted method, namely RPF-SFISTA, and a new parameter-free aggressive regularization method, namely A-REG, for solving strongly convex and convex composite optimization problems, respectively. RPF-SFISTA has the major advantage that it requires no knowledge of both the strong convexity parameter of the entire composite objective and the Lipschitz constant of … Read more

Complexity of Adagrad and other first-order methods for nonconvex optimization problems with bounds constraints

Published: 2024/06/22, Updated: 2024/10/31

Serge Gratton

Sadok Jerad

Philippe L. Toint

Bound-constrained Optimization, Data Science Algorithms, Nonlinear Optimization Adagrad, convergence bounds, evaluation complexity, first-order methods, objective-function-free optimization (OFFO), second-order models

A parametric class of trust-region algorithms for constrained nonconvex optimization is analyzed, where the objective function is never computed. By defining appropriate first-order stationarity criteria, we are able to extend the Adagrad method to the newly considered problem and retrieve the standard complexity rate of the projected gradient method that uses both the gradient and … Read more

The Role of Level-Set Geometry on the Performance of PDHG for Conic Linear Optimization

Published: 2024/06/04, Updated: 2024/07/15

Zikai Xiong

Robert M. Freund

Convex Optimization, Linear Programming, Linear, Cone and Semidefinite Programming condition numbers, conic optimization, Convergence Guarantees, first-order methods, Linear Optimization, numerical experiments

We consider solving huge-scale instances of (convex) conic linear optimization problems, at the scale where matrix-factorization-free methods are attractive or necessary. The restarted primal-dual hybrid gradient method (rPDHG) — with heuristic enhancements and GPU implementation — has been very successful in solving huge-scale linear programming (LP) problems; however its application to more general conic convex … Read more

A Max-Min-Max Algorithm for Large-Scale Robust Optimization

Published: 2024/04/08

Kai Tu

Zhi Chen

Man-Chung Yue

Robust Optimization decision making under uncertainty, first-order methods, Max-min-max problems, robust optimization

Robust optimization (RO) is a powerful paradigm for decision making under uncertainty. Existing algorithms for solving RO, including the reformulation approach and the cutting-plane method, do not scale well, hindering the application of RO to large-scale decision problems. In this paper, we devise a first-order algorithm for solving RO based on a novel max-min-max perspective. … Read more

Computational Guarantees for Restarted PDHG for LP based on “Limiting Error Ratios” and LP Sharpness

Published: 2023/12/27, Updated: 2024/04/29

Zikai Xiong

Robert M. Freund

Convex Optimization, Linear Programming, Linear, Cone and Semidefinite Programming computational complexity, error ratio, first-order methods, Linear Optimization, linear program, PDHG, restarts, Sharpness.

In recent years, there has been growing interest in solving linear optimization problems – or more simply “LP” – using first-order methods in order to avoid the costly matrix factorizations of traditional methods for huge-scale LP instances. The restarted primal-dual hybrid gradient method (PDHG) – together with some heuristic techniques – has emerged as a … Read more

On the Relation Between LP Sharpness and Limiting Error Ratio and Complexity Implications for Restarted PDHG

Published: 2023/12/27, Updated: 2023/12/29

Zikai Xiong

Robert M. Freund

There has been a recent surge in development of first-order methods (FOMs) for solving huge-scale linear programming (LP) problems. The attractiveness of FOMs for LP stems in part from the fact that they avoid costly matrix factorization computation. However, the efficiency of FOMs is significantly influenced – both in theory and in practice – by … Read more

First-order penalty methods for bilevel optimization

Published: 2023/01/04

Zhaosong Lu

Sanyou Mei

Constrained Nonlinear Optimization bilevel optimization, first-order methods, minimax optimization, operation complexity, penalty methods

In this paper we study a class of unconstrained and constrained bilevel optimization problems in which the lower-level part is a convex optimization problem, while the upper-level part is possibly a nonconvex optimization problem. In particular, we propose penalty methods for solving them, whose subproblems turn out to be a structured minimax problem and are … Read more

Fixed-Point Automatic Differentiation of Forward–Backward Splitting Algorithms for Partly Smooth Functions

Published: 2022/09/29

Sheheryar Mehmood

Peter Ochs

Convex and Nonsmooth Optimization, Optimization in Data Science bilevel optimization, convex optimization, first-order methods, partial smoothness, sensitivity analysis

A large class of non-smooth practical optimization problems can be written as minimization of a sum of smooth and partly smooth functions. We consider such structured problems which also depend on a parameter vector and study the problem of differentiating its solution mapping with respect to the parameter which has far reaching applications in sensitivity … Read more