Stochastic convergence of parallel asynchronous adaptive first-order methods

Published: 2026/06/01, Updated: 2026/06/24

Data Science Algorithms, Stochastic Programming, Unconstrained Optimization asynchronous methods, first-order methods, global rate of convergence, heterogeneous deep learning problems, parallel computing, Unconstrained nonconvex optimization Short URL: https://optimization-online.org/?p=35029

A new class of asynchronous adaptive first-order optimization methods is introduced, comprising asynchronous variants of several popular
algorithms. Versions of these methods using momentum and/or inexact normalization are also considered. The convergence of methods in the
class on non-convex functions is analyzed in a fully stochastic setting, and is shown to be (up to logarithmic factors) of order lO(1/sqrt{t}) under
reasonable assumptions. Numerical experiments suggest that such asynchronous adaptive algorithms are very relevant in heterogeneous large-scale machine learning system

Article

Download

View PDF