Damek Davis – Page 2 – Optimization Online

Stochastic subgradient method converges on tame functions

Published: 2018/05/25

Nonsmooth Optimization differential inclusion, lyapunov function, semialgebraic, stochastic subgradient method, subdifferential

This work considers the question: what convergence guarantees does the stochastic subgradient method have in the absence of smoothness and convexity? We prove that the stochastic subgradient method, on any semialgebraic locally Lipschitz function, produces limit points that are all first-order stationary. More generally, our result applies to any function with a Whitney stratifiable graph. … Read more

Stochastic model-based minimization of weakly convex functions

Published: 2018/03/29

Damek Davis

Dmitriy Drusvyatskiy

Convex and Nonsmooth Optimization moreau envelope, prox-linear method, proximal point algorithm, stochastic, subgradient method, weakly convex

We consider an algorithm that successively samples and minimizes stochastic models of the objective function. We show that under weak-convexity and Lipschitz conditions, the algorithm drives the expected norm of the gradient of the Moreau envelope to zero at the rate $O(k^{-1/4})$. Our result yields the first complexity guarantees for the stochastic proximal point algorithm … Read more

Stochastic subgradient method converges at the rate (k^{-1/4})$ on weakly convex function

Published: 2018/02/08, Updated: 2018/02/13

Damek Davis

Dmitriy Drusvyatskiy

Nonsmooth Optimization moreau envelope, stochastic, subgradient method

We prove that the projected stochastic subgradient method, applied to a weakly convex problem, drives the gradient of the Moreau envelope to zero at the rate $O(k^{-1/4})$. ArticleDownload View PDF

The nonsmooth landscape of phase retrieval

Published: 2017/11/09

Damek Davis

Dmitriy Drusvyatskiy

Courtney Paquette

Nonsmooth Optimization eigenvalues, phase retrieval, spectral functions, stationary points, subdifferential, subgradient method, variational principle

We consider a popular nonsmooth formulation of the real phase retrieval problem. We show that under standard statistical assumptions, a simple subgradient method converges linearly when initialized within a constant relative distance of an optimal solution. Seeking to understand the distribution of the stationary points of the problem, we complete the paper by proving that … Read more

A SMART Stochastic Algorithm for Nonconvex Optimization with Applications to Robust Machine Learning

Published: 2016/09/21, Updated: 2016/10/04

Aleksandr Y. Aravkin

Damek Davis

Nonlinear Optimization, Nonsmooth Optimization, Statistics machine learning, nonconvex optimization, saga, smart, svrg, trimmed estimators, variance reduction

Machine learning theory typically assumes that training data is unbiased and not adversarially generated. When real training data deviates from these assumptions, trained models make erroneous predictions, sometimes with disastrous effects. Robust losses, such as the huber norm are designed to mitigate the effects of such contaminated data, but they are limited to the regression … Read more

The Sound of APALM Clapping: Faster Nonsmooth Nonconvex Optimization with Stochastic Asynchronous PALM

Published: 2016/06/04, Updated: 2016/06/07

Damek Davis

Brent Edmunds

Madeleine Udell

Nonlinear Optimization, Nonsmooth Optimization asynchronous algorithm, block-coordinate descent, matrix factorization, nonconvex, nonsmooth, stochastic algorithm

We introduce the Stochastic Asynchronous Proximal Alternating Linearized Minimization (SAPALM) method, a block coordinate stochastic proximal-gradient method for solving nonconvex, nonsmooth optimization problems. SAPALM is the first asynchronous parallel optimization method that provably converges on a large class of nonconvex, nonsmooth problems. We prove that SAPALM matches the best known rates of convergence — among … Read more

The Asynchronous PALM Algorithm for Nonsmooth Nonconvex Problems

Published: 2016/03/08, Updated: 2016/04/02

Damek Davis

Nonsmooth Optimization asynchronous, coordinate descent, first-order methods, nonconvex, nonsmooth, stochastic algorithm

We introduce the Asynchronous PALM algorithm, a new extension of the Proximal Alternating Linearized Minimization (PALM) algorithm for solving nonconvex nonsmooth optimization problems. Like the PALM algorithm, each step of the Asynchronous PALM algorithm updates a single block of coordinates; but unlike the PALM algorithm, the Asynchronous PALM algorithm eliminates the need for sequential updates … Read more

SMART: The Stochastic Monotone Aggregated Root-Finding Algorithm

Published: 2015/12/10, Updated: 2015/12/29

Damek Davis

Convex and Nonsmooth Optimization aggregated gradient, asynchronous updates, coordinate updates, operator splitting, stochastic algorithm, variance reduction

We introduce the Stochastic Monotone Aggregated Root-Finding (SMART) algorithm, a new randomized operator-splitting scheme for finding roots of finite sums of operators. These algorithms are similar to the growing class of incremental aggregated gradient algorithms, which minimize finite sums of functions; the difference is that we replace gradients of functions with black-boxes called operators, which … Read more

An (n\log(n))$ Algorithm for Projecting Onto the Ordered Weighted $\ell_1$ Norm Ball

Published: 2015/03/31, Updated: 2015/06/26

Damek Davis

Convex and Nonsmooth Optimization octogonal shrinkage and clustering algorithm for regression, ordered weighted $\ell_1$ norm, projection operator, proximal operator

The ordered weighted $\ell_1$ (OWL) norm is a newly developed generalization of the Octogonal Shrinkage and Clustering Algorithm for Regression (OSCAR) norm. This norm has desirable statistical properties and can be used to perform simultaneous clustering and regression. In this paper, we show how to compute the projection of an $n$-dimensional vector onto the OWL … Read more

A Three-Operator Splitting Scheme and its Optimization Applications

Published: 2015/03/04, Updated: 2015/04/04

Damek Davis

Wotao Yin

Convex and Nonsmooth Optimization

Operator splitting schemes have been successfully used in computational sciences to reduce complex problems into a series of simpler subproblems. Since 1950s, these schemes have been widely used to solve problems in PDE and control. Recently, large-scale optimization problems in machine learning, signal processing, and imaging have created a resurgence of interest in operator-splitting based … Read more