accelerated gradient method – Optimization Online

Accelerated Bregman Proximal Gradient Methods for Relatively Smooth Convex Optimization

Published: 2018/08/09

Convex Optimization, Nonlinear Optimization accelerated gradient method, bregman distance, convex optimization, proximal gradient method, relatively smooth

We consider the problem of minimizing the sum of two convex functions: one is differentiable and relatively smooth with respect to a reference convex function, and the other can be nondifferentiable but simple to optimize. The relatively smooth condition is much weaker than the standard assumption of uniform Lipschitz continuity of the gradients, thus significantly … Read more

An optimal first order method based on optimal quadratic averaging

Published: 2016/04/22, Updated: 2016/04/25

Dmitriy Drusvyatskiy

Maryam Fazel

Scott Roy

Convex Optimization accelerated gradient method, convex quadratic programming, first-order methods

In a recent paper, Bubeck, Lee, and Singh introduced a new first order method for minimizing smooth strongly convex functions. Their geometric descent algorithm, largely inspired by the ellipsoid method, enjoys the optimal linear rate of convergence. Motivated by their work, we propose a close variant that iteratively maintains a quadratic global under-estimator of the … Read more

An accelerated non-Euclidean hybrid proximal extragradient-type Algorithm for convex-concave saddle-point Problems

Published: 2015/09/18

Oliver Kolossoski

Renato D.C. Monteiro

Convex Optimization accelerated gradient method, bregman distances, complexity, convex optimization, ergodic convergence, hybrid proximal extragradient method, inexact proximal method, maximal monotone operator, saddle point problem

This paper describes an accelerated HPE-type method based on general Bregman distances for solving monotone saddle-point (SP) problems. The algorithm is a special instance of a non-Euclidean hybrid proximal extragradient framework introduced by Svaiter and Solodov [28] where the prox sub-inclusions are solved using an accelerated gradient method. It generalizes the accelerated HPE algorithm presented … Read more

An Accelerated Proximal Coordinate Gradient Method and its Application to Regularized Empirical Risk Minimization

Published: 2014/07/07

Qihang Lin

Zhaosong Lu

Lin Xiao

Applications - Science and Engineering, Convex and Nonsmooth Optimization, Stochastic Programming accelerated gradient method, coordinate gradient method, empirical risk minimization

We consider the problem of minimizing the sum of two convex functions: one is smooth and given by a gradient oracle, and the other is separable over blocks of coordinates and has a simple known structure over each block. We develop an accelerated randomized proximal coordinate gradient (APCG) method for minimizing such convex composite functions. … Read more

An adaptive accelerated proximal gradient method and its homotopy continuation for sparse optimization

Published: 2013/04/05

Qihang Lin

Lin Xiao

Convex and Nonsmooth Optimization accelerated gradient method, homotopy continuation, restricted eigenvalue conditions, sparse optimization

We consider optimization problems with an objective function that is the sum of two convex terms: one is smooth and given by a black-box oracle, and the other is general but with a simple, known structure. We first present an accelerated proximal gradient (APG) method for problems where the smooth part of the objective function … Read more

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

Published: 2010/04/15, Updated: 2010/11/15

Lin Xiao

Convex and Nonsmooth Optimization, Stochastic Programming accelerated gradient method, dual averaging method, online optimization, sparse regularization, stochastic learning

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning task, and the other is a simple regularization term such as $\ell_1$-norm for promoting sparsity. We develop extensions of Nesterov’s dual averaging method, that can exploit the … Read more