Adagrad – Optimization Online

A Simple First-Order Algorithm for Full-Rank Equality Constrained Optimization

Published: 2025/10/17, Updated: 2025/10/18

A very simple first-order algorithm is proposed for solving nonlinear optimization problems with deterministic nonlinear equality constraints. This algorithm adaptively selects steps in the plane tangent to the constraints or steps that reduce infeasibility, without using a merit function or a filter. The tangent steps are based on the AdaGrad method for unconstrained minimization. The … Read more

Recursive Bound-Constrained AdaGrad with Applications to Multilevel and Domain Decomposition Minimization

Published: 2025/07/15

Serge Gratton

Alena Kopanicakova

Philippe L. Toint

Bound-constrained Optimization, Data Science Algorithms, Nonlinear Optimization Adagrad, bound constraints, complexity, domain decomposition, multilevel optimization, neural network training, PDE-based problems

Two OFFO (Objective-Function Free Optimization) noise tolerant algorithms are presented that handle bound constraints, inexact gradients and use second-order information when available. The first is a multi-level method exploiting a hierarchical description of the problem and the second is a domain-decomposition method covering the standard addditive Schwarz decompositions. Both are generalizations of the first-order AdaGrad … Read more

Fast Stochastic Second-Order Adagrad for Nonconvex Bound-Constrained Optimization

Published: 2025/05/09

Bound-constrained Optimization, Data Science Algorithms, Nonlinear Optimization Adagrad, bound constraints, complexity, objective-function-free optimization (OFFO), second-order information, stochastic nonlinear optimization, stochastic projected gradients

ADAGB2, a generalization of the Adagrad algorithm for stochastic optimization is introduced, which is also applicable to bound-constrained problems and capable of using second-order information when available. It is shown that, given delta in (0,1) and epsilon in (0,1], the ADAGB2 algorithm needs at most O(epsilon^{-2}) iterations to ensure an epsilon-approximate first-order critical point of … Read more

Complexity of Adagrad and other first-order methods for nonconvex optimization problems with bounds constraints

Published: 2024/06/22, Updated: 2024/10/31

Serge Gratton

Sadok Jerad

Philippe L. Toint

Bound-constrained Optimization, Data Science Algorithms, Nonlinear Optimization Adagrad, convergence bounds, evaluation complexity, first-order methods, objective-function-free optimization (OFFO), second-order models

A parametric class of trust-region algorithms for constrained nonconvex optimization is analyzed, where the objective function is never computed. By defining appropriate first-order stationarity criteria, we are able to extend the Adagrad method to the newly considered problem and retrieve the standard complexity rate of the projected gradient method that uses both the gradient and … Read more

OFFO minimization algorithms for second-order optimality and their complexity

Published: 2022/03/07

Serge Gratton

Philippe L. Toint

Unconstrained Optimization Adagrad, evaluation complexity, global rate of convergence, objective-function-free optimization (OFFO), second-order optimality

An Adagrad-inspired class of algorithms for smooth unconstrained optimization is presented in which the objective function is never evaluated and yet the gradient norms decrease at least as fast as O(1/\sqrt{k+1}) while second-order optimality measures converge to zero at least as fast as O(1/(k+1)^{1/3}). This latter rate of convergence is shown to be essentially sharp … Read more

Parametric complexity analysis for a class of first-order Adagrad-like algorithms

Published: 2022/03/07

Serge Gratton

Sadok Jerad

Philippe L. Toint

Stochastic Programming, Unconstrained Optimization Adagrad, convergence bounds, evaluation complexity, first-order methods, noisy gradients, objective-function-free optimization

A class of algorithms for optimization in the presence of noise is presented, that does not require the evaluation of the objective function. This class generalizes the well-known Adagrad method. The complexity of this class is then analyzed as a function of its parameters, and it is shown that some methods of the class enjoy … Read more

First-Order Objective-Function-Free Optimization Algorithms and Their Complexity

Published: 2022/03/07

Serge Gratton

Sadok Jerad

Philippe L. Toint

Unconstrained Optimization Adagrad, convergence bounds, evaluation complexity, first-order methods, objective-function-free optimization (OFFO), second-order models

A class of algorithms for unconstrained nonconvex optimization is considered where the value of the objective function is never computed. The class contains a deterministic version of the first-order Adagrad method typically used for minimization of noisy function, but also allows the use of second-order information when available. The rate of convergence of methods in … Read more