stochastic approximation – Optimization Online

Stochastic Approximation with Block Coordinate Optimal Stepsizes

Published: 2025/07/16

Data Science Algorithms, Optimization in Data Science adaptive algorithms, block coordinate stepsizes, stochastic approximation

We consider stochastic approximation with block-coordinate stepsizes and propose adaptive stepsize rules that aim to minimize the expected distance from the next iterate to an optimal point. These stepsize rules employ online estimates of the second moment of the search direction along each block coordinate. The popular Adam algorithm can be interpreted as a particular … Read more

Multi-cut stochastic approximation methods for solving stochastic convex composite optimization

Published: 2025/05/22, Updated: 2026/03/02

Jiaming Liang

Honghao Zhang

Renato D.C. Monteiro

Convex Optimization, Stochastic Programming optimal complexity bound, proximal bundle method, stochastic approximation, stochastic convex composite optimization

This paper considers the stochastic convex composite optimization problem and presents multi-cut stochastic approximation (SA) methods for solving it, whose models in expectation overestimate its objective function. The multi-cut model obtained by taking the maximum of a finite number of linearizations of the stochastic objective function provides a biased estimate of the objective function, with … Read more

Single-Timescale Multi-Sequence Stochastic Approximation Without Fixed Point Smoothness: Theories and Applications

Published: 2024/10/18

Nonlinear Optimization, Stochastic Programming bilevel optimization, convergence analysis, distributed learning, stochastic approximation

Stochastic approximation (SA) that involves multiple coupled sequences, known as multiple-sequence SA (MSSA), finds diverse applications in the fields of signal processing and machine learning. However, existing theoretical understandings of MSSA are limited: the multi-timescale analysis implies a slow convergence rate, whereas the single-timescale analysis relies on a stringent fixed point smoothness assumption. This paper … Read more

A Quasi-Newton Algorithm for Optimal Discretization of Markov Processes

Published: 2023/03/29

Nils Löhndorf

David Wozabal

Stochastic Programming clustering, Quasi-Newton method, scenario generation, scenario lattices, stochastic approximation, stochastic dual dynamic programming

In stochastic programming and stochastic-dynamic programming discretization of random model parameters is often unavoidable. We propose a quasi-Newton learning algorithm to discretize multi-dimensional, continuous discrete-time Markov processes to scenario lattices by minimizing the Wasserstein distance between the unconditional distributions of process and lattice. Scenario lattices enable accurate discretization of the conditional distributions of Markov processes … Read more

Stochastic Multi-level Composition Optimization Algorithms with Level-Independent Convergence Rates

Published: 2020/09/07

Krishnakumar Balasubramanian

Saeed Ghadimi

Anthony Nguyen

Nonlinear Optimization, Statistics, Stochastic Programming deep learning, nonconvex optimization, stochastic approximation, stochsatic composition optimization

In this paper, we study smooth stochastic multi-level composition optimization problems, where the objective function is a nested composition of $T$ functions. We assume access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle. For solving this class of problems, we propose two algorithms using moving-average stochastic estimates, and analyze … Read more

Accuracy and fairness trade-offs in machine learning: A stochastic multi-objective approach

Published: 2020/08/03, Updated: 2020/09/03

Suyun Liu

Luis Nunes Vicente

Data-Mining, Multi-Criteria Optimization, Stochastic Programming disparate impact, equal opportunity, fairness, machine learning, multi-objective optimization, nonconvex optimization, pareto fronts, sensitive/protected attributes, stochastic approximation, supervised learning

In the application of machine learning to real life decision-making systems, e.g., credit scoring and criminal justice, the prediction outcomes might discriminate against people with sensitive attributes, leading to unfairness. The commonly used strategy in fair machine learning is to include fairness as a constraint or a penalization term in the minimization of the prediction … Read more

Penalized stochastic gradient methods for stochastic convex optimization with expectation constraints

Published: 2019/09/12

Xiantao Xiao

Stochastic Programming convergence analysis, expectation constraints, numerical experiments, stochastic approximation, stochastic convex optimization

Stochastic gradient method and its variants are simple yet effective for minimizing an expectation function over a closed convex set. However, none of these methods are applicable to solve stochastic programs with expectation constraints, since the projection onto the feasible set is prohibitive. To deal with the expectation constrained stochastic convex optimization problems, we propose … Read more

Normal Approximation for Stochastic Gradient Descent via Non-Asymptotic Rates of Martingale CLT

Published: 2019/04/09

Andreas Anastasiou

Krishnakumar Balasubramanian

Murat A. Erdogdu

Nonlinear Optimization, Stochastic Programming normal approximation, rates of martingale clt, stochastic approximation

We provide non-asymptotic convergence rates of the Polyak-Ruppert averaged stochastic gradient descent (SGD) to a normal random vector for a class of twice-differentiable test functions. A crucial intermediate step is proving a non-asymptotic martingale central limit theorem (CLT), i.e., establishing the rates of convergence of a multivariate martingale difference sequence to a normal random vector, … Read more

A Single Time-Scale Stochastic Approximation Method for Nested Stochastic Optimization

Published: 2018/12/27

Saeed Ghadimi

Andrzej Ruszczyński

Mengdi Wang

Nonlinear Optimization, Stochastic Programming nested stochastic optimization, nonconvex optimization, stochastic approximation

We study constrained nested stochastic optimization problems in which the objective function is a composition of two smooth functions whose exact values and derivatives are not available. We propose a single time-scale stochastic approximation algorithm, which we call the Nested Averaged Stochastic Approximation (NASA), to find an approximate stationary point of the problem. The algorithm … Read more

A stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs

Published: 2018/12/17, Updated: 2020/05/28

Rohit Kannan

James R. Luedtke

Stochastic Programming chance constraints, efficient frontier, stochastic approximation, stochastic subgradient

We propose a stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs. Our approach is based on a bi-objective viewpoint of chance-constrained programs that seeks solutions on the efficient frontier of optimal objective value versus risk of constraint violation. To this end, we construct a reformulated problem whose objective is to minimize … Read more