Convex and Nonsmooth Optimization – Page 62

A Random Block-Coordinate Douglas-Rachford Splitting Method with Low Computational Complexity for Binary Logistic Regression

Published: 2017/12/31

Convex Optimization, Statistics block-coordinate methods, machine learning, proximal algorithms, stochastic methods

In this paper, we propose a new optimization algorithm for sparse logistic regression based on a stochastic version of the Douglas Rachford splitting method. Our algorithm sweeps the training set by randomly selecting a mini-batch of data at each iteration, and it allows us to update the variables in a block coordinate manner. Our approach … Read more

A single potential governing convergence of conjugate gradient, accelerated gradient and geometric descent

Published: 2017/12/30

Sahar Karimi

Stephen A. Vavasis

Convex Optimization accelerated gradient, conjugate gradient method, geometric descent, strongly convex

Nesterov’s accelerated gradient (AG) method for minimizing a smooth strongly convex function $f$ is known to reduce $f({\bf x}_k)-f({\bf x}^*)$ by a factor of $\epsilon\in(0,1)$ after $k=O(\sqrt{L/\ell}\log(1/\epsilon))$ iterations, where $\ell,L$ are the two parameters of smooth strong convexity. Furthermore, it is known that this is the best possible complexity in the function-gradient oracle model of … Read more

Let’s Make Block Coordinate Descent Go Fast: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence

Published: 2017/12/23

Issam Laradji

Mark Schmidt

Julie Nutini

Convex and Nonsmooth Optimization block-coordinate descent, convex optimization, nonsmooth optimization

Block coordinate descent (BCD) methods are widely-used for large-scale numerical optimization because of their cheap iteration costs, low memory requirements, amenability to parallelization, and ability to exploit problem structure. Three main algorithmic choices influence the performance of BCD methods: the block partitioning strategy, the block selection rule, and the block update rule. In this paper … Read more

An Algorithm for Piecewise Linear Optimization of Objective Functions in Abs-normal Form

Published: 2017/12/22

Andreas Griewank

Andrea Walther

Nonsmooth Optimization abs-normal form, active set and signature, karush kuhn tucker (kkt), linear independence kink qualification (likq), normal growth, quadratic regularization, successive piecewise linear optimization (splop), tangential stationarity

In the paper [11] we derived first order (KKT) and second order (SSC) optimality conditions for functions defined by evaluation programs involving smooth elementals and absolute values. For this class of problems we showed in [12] that the natural algorithm of successive piecewise linear optimization with a proximal term (SPLOP) achieves a linear or even … Read more

Long-Step Path-Following Algorithm for Solving Symmetric Programming Problems with Nonlinear Objective Functions

Published: 2017/12/12

Leonid Faybusovich

Cunlu Zhou

Convex Optimization, Linear, Cone and Semidefinite Programming interior point methods, nonlinear objective functions, symmetric programming

We describe a long-step path-following algorithm for a class of symmetric programming problems with nonlinear convex objective functions. The complexity estimates similar to the case of a linear-quadratic objective function are established. The results of numerical experiments for the class of optimization problems involving quantum entropy are presented. Citation Preprint, University of Notre Dame, December … Read more

Convergence Rates for Deterministic and Stochastic Subgradient Methods Without Lipschitz Continuity

Published: 2017/12/11

Benjamin Grimmer

Convex Optimization convergence rate, convex optimization, subgradient method

We generalize the classic convergence rate theory for subgradient methods to apply to non-Lipschitz functions via a new measure of steepness. For the deterministic projected subgradient method, we derive a global $O(1/\sqrt{T})$ convergence rate for any function with at most exponential growth. Our approach implies generalizations of the standard convergence rates for gradient descent on … Read more

”Active-set complexity” of proximal gradient: How long does it take to find the sparsity pattern?

Published: 2017/12/10, Updated: 2018/10/14

Warren Hare

Mark Schmidt

Julie Nutini

Convex and Nonsmooth Optimization, Convex Optimization, Nonsmooth Optimization active-set complexity, active-set identification, proximal gradient methods

Proximal gradient methods have been found to be highly effective for solving minimization problems with non-negative constraints or L1-regularization. Under suitable nondegeneracy conditions, it is known that these algorithms identify the optimal sparsity pattern for these types of problems in a finite number of iterations. However, it is not known how many iterations this may … Read more

Two-level value function approach to nonsmooth optimistic and pessimistic bilevel programs

Published: 2017/12/01

Stephan Dempe

Boris S. Mordukhovich

Alain Zemkoho

Convex and Nonsmooth Optimization generalized differentiation, optimality conditions, optimistic and pessimistic bilevel programming, two-level value functions, variational analysis

The authors’ paper in Ref. [5], was the first one to provide detailed optimality conditions for pessimistic bilevel optimization. The results there were based on the concept of the two-level optimal value function introduced and analyzed in Ref. [4], for the case of optimistic bilevel programs. One of the basic assumptions in both of these … Read more

Iteration complexity of an inexact Douglas-Rachford method and of a Douglas-Rachford-Tseng’s F-B four-operator splitting method for solving monotone inclusions

Published: 2017/11/30

M. Marques Alves

Marina Geremia

Convex and Nonsmooth Optimization

In this paper, we propose and study the iteration complexity of an inexact Douglas-Rachford splitting (DRS) method and a Douglas-Rachford-Tseng’s forward-backward (F-B) splitting method for solving two-operator and four-operator monotone inclusions, respectively. The former method (although based on a slightly different mechanism of iteration) is motivated by the recent work of J. Eckstein and W. … Read more

Linear Convergence Rate of the Generalized Alternating Direction Method of Multipliers for a Class of Convex Optimization Problems

Published: 2017/11/29

Peng Jian-Wen

Xue-Qing Zhang

Convex and Nonsmooth Optimization generalized alternating direction method of multipliers, global linear convergence rate, local linear convergence rate, piecewise linear multifunction

Rencently, the generalized aternating direction method of multipliers (GADMM) proposed by Eckstein and Bertsekas has received intensive attention from a broad spectrum of areas. In this paper, we consider the convergence rate of GADMM when applying to the convex optimization problems that the subdifferentials of the underlying functions are piecewise linear multifunctions, including LASSO, a … Read more