Francis Bach – Optimization Online

Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron

Published: 2019/02/25

Convex Optimization, Generalized Convexity/Monoticity interpolation, nesterov acceleration, over-parametrization, stochastic gradient descent

Modern machine learning focuses on highly expressive models that are able to fit or interpolate the data completely, resulting in zero training loss. For such models, we show that the stochastic gradients of common loss functions satisfy a strong growth condition. Under this condition, we prove that constant step-size stochastic gradient descent (SGD) with Nesterov … Read more

Regularized Nonlinear Acceleration

Published: 2017/05/30

Francis Bach

Alexandre d'Aspremont

Damien Scieur

Convex Optimization

We describe a convergence acceleration technique for generic optimization problems. Our scheme computes estimates of the optimum from a nonlinear average of the iterates produced by any optimization method. The weights in this average are computed via a simple linear system, whose solution can be updated online. This acceleration scheme runs in parallel to the … Read more

Integration Methods and Accelerated Optimization Algorithms

Published: 2017/05/30

Francis Bach

Alexandre d'Aspremont

Vincent Roulet

Damien Scieur

Convex Optimization

We show that accelerated optimization methods can be seen as particular instances of multi-step integration schemes from numerical analysis, applied to the gradient flow equation. In comparison with recent advances in this vein, the differential equation considered here is the basic gradient flow and we show that multi-step schemes allow integration of this differential equation … Read more

Regularized nonlinear acceleration

Published: 2016/09/14

Francis Bach

Alexandre d'Aspremont

Damien Scieur

Convex Optimization, Unconstrained Optimization acceleration

A Stochastic Gradient Method with an Exponential Convergence Rate for Strongly-Convex Optimization with Finite Training Sets

Published: 2012/02/28

Francis Bach

Mark Schmidt

Nicolas Le Roux

Convex Optimization, Stochastic Approaches

We propose a new stochastic gradient method for optimizing the sum of a finite set of smooth functions, where the sum is strongly convex. While standard stochastic gradient methods converge at sublinear rates for this problem, the proposed method incorporates a memory of previous gradient values in order to achieve a linear convergence rate. Numerical … Read more