empirical risk minimization – Optimization Online

Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization

Published: 2022/08/29, Updated: 2023/11/22

Constrained Nonlinear Optimization, Convex Optimization, Stochastic Programming computational complexity, convex optimization, empirical risk minimization, frank-wolfe, linear minimization oracle, linear prediction

The Frank-Wolfe method has become increasingly useful in statistical and machine learning applications, due to the structure-inducing properties of the iterates, and especially in settings where linear minimization over the feasible set is more computationally efficient than projection. In the setting of Empirical Risk Minimization — one of the fundamental optimization problems in statistical and … Read more

Accelerated Stochastic Peaceman-Rachford Method for Empirical Risk Minimization

Published: 2021/08/17, Updated: 2022/10/17

Convex Optimization, Stochastic Programming complexity, convex optimization, empirical risk minimization, indefinite proximal term, stochastic peaceman-rachford method

This work is devoted to studying an Accelerated Stochastic Peaceman-Rachford Splitting Method (AS-PRSM) for solving a family of structural empirical risk minimization problems. The objective function to be optimized is the sum of a possibly nonsmooth convex function and a finite-sum of smooth convex component functions. The smooth subproblem in AS-PRSM is solved by a stochastic gradient method using variance reduction … Read more

A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

Published: 2018/03/04, Updated: 2018/05/26

Stephen Wright

Cong Han Lim

Ching-pei Lee

Data-Mining, Nonlinear Optimization, Parallel Algorithms distributed optimization, empirical risk minimization, inexact method, nonsmooth optimization, proximal method, quasi-newton methods, regularized optimization, variable metrics

We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving ERM problems with a nonsmooth regularization term. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting or work only for specific regularizers. Our algorithm uses successive quadratic approximations, and we describe how to … Read more

DSCOVR: Randomized Primal-Dual Block Coordinate Algorithms for Asynchronous Distributed Optimization

Published: 2017/10/15

Convex and Nonsmooth Optimization, Parallel Algorithms asynchronous distributed optimization, empirical risk minimization, parameter servers, primal-dual coordinate algorithms, randomized algorithms, saddle point problem

Machine learning with big data often involves large optimization models. For distributed optimization over a cluster of machines, frequent communication and synchronization of all model parameters (optimization variables) can be very costly. A promising solution is to use parameter servers to store different subsets of the model parameters, and update them asynchronously at different machines … Read more

On the convergence of stochastic bi-level gradient methods

Published: 2016/02/15

Nicolas Couellan

Wenjuan Wang

Data-Mining, Nonlinear Optimization bi-level optimization, empirical risk minimization, gradient approximation, stochastic gradient

We analyze the convergence of stochastic gradient methods for bi-level optimization problems. We address two specific cases: first when the outer objective function can be expressed as a finite sum of independent terms, and next when both the outer and inner objective functions can be expressed as finite sums of independent terms. We assume Lipschitz … Read more

Communication-Efficient Distributed Optimization of Self-Concordant Empirical Loss

Published: 2015/01/05

Lin Xiao

Yuchen Zhang

Convex Optimization, Statistics distributed optimization, empirical risk minimization, inexact newton method, preconditioned conjugate gradient, self-concordance

We consider distributed convex optimization problems originated from sample average approximation of stochastic optimization, or empirical risk minimization in machine learning. We assume that each machine in the distributed computing system has access to a local empirical loss function, constructed with i.i.d. data sampled from a common distribution. We propose a communication-efficient distributed algorithm to … Read more

Stochastic Primal-Dual Coordinate Method for Regularized Empirical Risk Minimization

Published: 2014/09/12

Lin Xiao

Yuchen Zhang

Convex and Nonsmooth Optimization accelerated primal-dual algorithms, empirical risk minimization, stochastic coordinate descent/ascent method

We consider a generic convex optimization problem associated with regularized empirical risk minimization of linear predictors. The problem structure allows us to reformulate it as a convex-concave saddle point problem. We propose a stochastic primal-dual coordinate (SPDC) method, which alternates between maximizing over a randomly chosen dual variable and minimizing over the primal variable. An … Read more

An Accelerated Proximal Coordinate Gradient Method and its Application to Regularized Empirical Risk Minimization

Published: 2014/07/07

Qihang Lin

Zhaosong Lu

Lin Xiao

Applications - Science and Engineering, Convex and Nonsmooth Optimization, Stochastic Programming accelerated gradient method, coordinate gradient method, empirical risk minimization

We consider the problem of minimizing the sum of two convex functions: one is smooth and given by a gradient oracle, and the other is separable over blocks of coordinates and has a simple known structure over each block. We develop an accelerated randomized proximal coordinate gradient (APCG) method for minimizing such convex composite functions. … Read more