stochastic gradient method – Optimization Online

Expected Value of Matrix Quadratic Forms with Wishart distributed Random Matrices

Published: 2022/12/02, Updated: 2022/12/13

Convex Optimization, Data Science Theory, Stochastic Approaches averaging, expected value, quadratic form, second momentum, stochastic gradient method, Wishart distribution

To explore the limits of a stochastic gradient method, it may be useful to consider an example consisting of an infinite number of quadratic functions. In this context, it is appropriate to determine the expected value and the covariance matrix of the stochastic noise, i.e. the difference of the true gradient and the approximated gradient … Read more

A Stochastic Trust Region Algorithm Based on Careful Step Normalization

Published: 2017/12/29, Updated: 2018/06/26

Frank E. Curtis

Katya Scheinberg

Rui Shi

Nonlinear Optimization, Stochastic Programming, Unconstrained Optimization deep neural networks, finite sum minimization, logistic regression, machine learning, stochastic gradient method, stochastic optimization, trust-region methods

An algorithm is proposed for solving stochastic and finite sum minimization problems. Based on a trust region methodology, the algorithm employs normalized steps, at least as long as the norms of the stochastic gradient estimates are within a specified interval. The complete algorithm—which dynamically chooses whether or not to employ normalized steps—is proved to have … Read more

A Proximal Stochastic Gradient Method with Progressive Variance Reduction

Published: 2014/03/19

Lin Xiao

Tong Zhang

Convex and Nonsmooth Optimization, Stochastic Programming incremental gradient method, proximal gradient method, stochastic gradient method

We consider the problem of minimizing the sum of two convex functions: one is the average of a large number of smooth component functions, and the other is a general convex function that admits a simple proximal mapping. We assume the whole objective function is strongly convex. Such problems often arise in machine learning, known … Read more