distributed learning – Optimization Online

Single-Timescale Multi-Sequence Stochastic Approximation Without Fixed Point Smoothness: Theories and Applications

Published: 2024/10/18

Nonlinear Optimization, Stochastic Programming bilevel optimization, convergence analysis, distributed learning, stochastic approximation

\(\) Stochastic approximation (SA) that involves multiple coupled sequences, known as multiple-sequence SA (MSSA), finds diverse applications in the fields of signal processing and machine learning. However, existing theoretical understandings of MSSA are limited: the multi-timescale analysis implies a slow convergence rate, whereas the single-timescale analysis relies on a stringent fixed point smoothness assumption. This … Read more

A Simplified Convergence Theory for Byzantine Resilient Stochastic Gradient Descent

Published: 2022/09/01

Lindon Roberts

Optimization in Data Science, Stochastic Programming Byzantine resilience, distributed learning, stochastic optimization

In distributed learning, a central server trains a model according to updates provided by nodes holding local data samples. In the presence of one or more malicious servers sending incorrect information (a Byzantine adversary), standard algorithms for model training such as stochastic gradient descent (SGD) fail to converge. In this paper, we present a simplified … Read more

Optimal Distributed Online Prediction using Mini-Batches

Published: 2011/02/18

Convex and Nonsmooth Optimization, Data-Mining, Stochastic Programming distributed learning, online convex optimization, parallel computing, stochastic optimization

Online prediction methods are typically presented as serial algorithms running on a single processor. However, in the age of web-scale prediction problems, it is increasingly common to encounter situations where a single processor cannot keep up with the high rate at which inputs arrive. In this work we present the distributed mini-batch algorithm, a method … Read more