weighted averaging – Optimization Online

Optimized convergence of stochastic gradient descent by weighted averaging

Published: 2022/09/23, Updated: 2022/10/05

Convex Optimization, Data Science Theory, Stochastic Approaches convex optimization, noise, optimal step lengths, optimal weights, stochastic gradient descent, weighted averaging

Under mild assumptions stochastic gradient methods asymptotically achieve an optimal rate of convergence if the arithmetic mean of all iterates is returned as an approximate optimal solution. However, in the absence of stochastic noise, the arithmetic mean of all iterates converges considerably slower to the optimal solution than the iterates themselves. And also in the … Read more