block-coordinate descent – Optimization Online

Block cubic Newton with greedy selection

Published: 2024/07/25

Nonlinear Optimization block-coordinate descent, Cubic Newton methods, second-order methods, worst-case iteration-complexity

 A second-order block coordinate descent method is proposed for the unconstrained minimization of an objective function with Lipschitz continuous Hessian. At each iteration, a block of variables is selected by means of a greedy (Gauss-Southwell) rule which considers the amount of first-order stationarity violation, then an approximate minimizer of a cubic model is computed … Read more

Projection free methods on product domains

Published: 2023/02/09, Updated: 2023/12/04

Immanuel M. Bomze

Francesco Rinaldi

Damiano Zeffiro

Convex Optimization, Nonlinear Optimization block-coordinate descent, first order optimization, projection-free optimization

Projection-free block-coordinate methods avoid high computational cost per iteration and at the same time exploit the particular problem structure of product domains. Frank-Wolfe-like approaches rank among the most popular ones of this type. However, as observed in the literature, there was a gap between the classical Frank-Wolfe theory and the block-coordinate case. Moreover, most of … Read more

Global Convergence in Deep Learning with Variable Splitting via the Kurdyka-{\L}ojasiewicz Property

Published: 2018/10/22, Updated: 2019/07/05

Convex and Nonsmooth Optimization, Data-Mining block-coordinate descent, deep learning, global convergence, kurdyka-lojasiewicz inequality

Deep learning has recently attracted a significant amount of attention due to its great empirical success. However, the effectiveness in training deep neural networks (DNNs) remains a mystery in the associated nonconvex optimizations. In this paper, we aim to provide some theoretical understanding on such optimization problems. In particular, the Kurdyka-{\L}ojasiewicz (KL) property is established … Read more

Let’s Make Block Coordinate Descent Go Fast: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence

Published: 2017/12/23

Issam Laradji

Mark Schmidt

Julie Nutini

Convex and Nonsmooth Optimization block-coordinate descent, convex optimization, nonsmooth optimization

Block coordinate descent (BCD) methods are widely-used for large-scale numerical optimization because of their cheap iteration costs, low memory requirements, amenability to parallelization, and ability to exploit problem structure. Three main algorithmic choices influence the performance of BCD methods: the block partitioning strategy, the block selection rule, and the block update rule. In this paper … Read more

Global Convergence of ADMM in Nonconvex Nonsmooth Optimization

Published: 2016/11/26, Updated: 2017/12/06

Yu Wang

Wotao Yin

Jinshan Zeng

Nonsmooth Optimization admm, augmented lagrangian method, block-coordinate descent, nonconvex optimization, sparse optimization

In this paper, we analyze the convergence of the alternating direction method of multipliers (ADMM) for minimizing a nonconvex and possibly nonsmooth objective function, $\phi(x_0,\ldots,x_p,y)$, subject to coupled linear equality constraints. Our ADMM updates each of the primal variables $x_0,\ldots,x_p,y$, followed by updating the dual variable. We separate the variable $y$ from $x_i$’s as it … Read more

A New First-order Algorithmic Framework for Optimization Problems with Orthogonality Constraints

Published: 2016/09/30, Updated: 2017/10/10

Basic Sciences Applications, Constrained Nonlinear Optimization block-coordinate descent, feasible method, householder transformation, orthogonality constraint, stiefel manifold, trust-region methods

In this paper, we consider a class of optimization problems with orthogonality constraints, the feasible region of which is called the Stiefel manifold. Our new framework combines a function value reduction step with a correction step. Different from the existing approaches, the function value reduction step of our algorithmic framework searches along the standard Euclidean … Read more

The Sound of APALM Clapping: Faster Nonsmooth Nonconvex Optimization with Stochastic Asynchronous PALM

Published: 2016/06/04, Updated: 2016/06/07

Damek Davis

Brent Edmunds

Madeleine Udell

Nonlinear Optimization, Nonsmooth Optimization asynchronous algorithm, block-coordinate descent, matrix factorization, nonconvex, nonsmooth, stochastic algorithm

We introduce the Stochastic Asynchronous Proximal Alternating Linearized Minimization (SAPALM) method, a block coordinate stochastic proximal-gradient method for solving nonconvex, nonsmooth optimization problems. SAPALM is the first asynchronous parallel optimization method that provably converges on a large class of nonconvex, nonsmooth problems. We prove that SAPALM matches the best known rates of convergence — among … Read more

Global Convergence of ADMM in Nonconvex Nonsmooth Optimization

Published: 2015/11/28

Yu Wang

Wotao Yin

Jinshan Zeng

Constrained Nonlinear Optimization, Nonlinear Optimization admm, augmented lagrangian method, block-coordinate descent, nonconvex optimization, sparse optimization

In this paper, we analyze the convergence of the alternating direction method of multipliers (ADMM) for minimizing a nonconvex and possibly nonsmooth objective function, $\phi(x_1,\ldots,x_p,y)$, subject to linear equality constraints that couple $x_1,\ldots,x_p,y$, where $p\ge 1$ is an integer. Our ADMM sequentially updates the primal variables in the order $x_1,\ldots,x_p,y$, followed by updating the dual … Read more

A remark on accelerated block coordinate descent for computing the proximity operators of a sum of convex functions

Published: 2015/01/03, Updated: 2015/01/06

Antonin Chambolle

Thomas Pock

Convex and Nonsmooth Optimization accelerated gradient methods, block-coordinate descent, total variation

We analyze alternating descent algorithms for minimizing the sum of a quadratic function and block separable non-smooth functions. In case the quadratic interactions between the blocks are pairwise, we show that the schemes can be accelerated, leading to improved convergence rates with respect to related accelerated parallel proximal descent. As an application we obtain very … Read more

Randomized First-order Methods for Saddle Point Optimization

Published: 2014/09/30, Updated: 2015/11/13

Cong D. Dang

Guanghui Lan

Convex and Nonsmooth Optimization alternating direction method of multipliers, block-coordinate descent, nonsmooth optimization, saddle point problem, stochastic optimization

In this paper, we present novel randomized algorithms for solving saddle point problems whose dual feasible region is a direct product of many convex sets. Our algorithms can achieve ${\cal O}(1/N)$ rate of convergence by solving only one dual subproblem at each iteration. Our algorithms can also achieve ${\cal O}(1/N^2)$ rate of convergence if a … Read more