December 2017 – Optimization Online

Smart “Predict, then Optimize”

Published: 2017/12/31, Updated: 2019/07/24

Applications - OR and Management Sciences, Data-Mining, Statistics

Many real-world analytics problems involve two significant challenges: prediction and optimization. Due to the typically complex nature of each challenge, the standard paradigm is to predict, then optimize. By and large, machine learning tools are intended to minimize prediction error and do not account for how the predictions will be used in a downstream optimization … Read more

A Random Block-Coordinate Douglas-Rachford Splitting Method with Low Computational Complexity for Binary Logistic Regression

Published: 2017/12/31

Luis M. Briceno-Arias

Émilie Chouzenoux

Jean-Christophe Pesquet

Giovanni Chierchia

Convex Optimization, Statistics block-coordinate methods, machine learning, proximal algorithms, stochastic methods

In this paper, we propose a new optimization algorithm for sparse logistic regression based on a stochastic version of the Douglas Rachford splitting method. Our algorithm sweeps the training set by randomly selecting a mini-batch of data at each iteration, and it allows us to update the variables in a block coordinate manner. Our approach … Read more

GEP-MSCRA for computing the group zero-norm regularized least squares estimator

Published: 2017/12/31

Shaohua Pan

Shujun Shujun

Nonsmooth Optimization, Statistics

This paper concerns with the group zero-norm regularized least squares estimator which, in terms of the variational characterization of the zero-norm, can be obtained from a mathematical program with equilibrium constraints (MPEC). By developing the global exact penalty for the MPEC, this estimator is shown to arise from an exact penalization problem that not only … Read more

A single potential governing convergence of conjugate gradient, accelerated gradient and geometric descent

Published: 2017/12/30

Sahar Karimi

Stephen A. Vavasis

Convex Optimization accelerated gradient, conjugate gradient method, geometric descent, strongly convex

Nesterov’s accelerated gradient (AG) method for minimizing a smooth strongly convex function $f$ is known to reduce $f({\bf x}_k)-f({\bf x}^*)$ by a factor of $\epsilon\in(0,1)$ after $k=O(\sqrt{L/\ell}\log(1/\epsilon))$ iterations, where $\ell,L$ are the two parameters of smooth strong convexity. Furthermore, it is known that this is the best possible complexity in the function-gradient oracle model of … Read more

Sum of squares certificates for stability of planar, homogeneous, and switched systems

Published: 2017/12/29

Amir Ali Ahmadi

Pablo A. Parrilo

Control Applications, Semi-definite Programming, Systems governed by Differential Equations Optimization lyapunov functions, semidefinite programming, stability of nonlinear systems, sum of squares

We show that existence of a global polynomial Lyapunov function for a homogeneous polynomial vector field or a planar polynomial vector field (under a mild condition) implies existence of a polynomial Lyapunov function that is a sum of squares (sos) and that the negative of its derivative is also a sum of squares. This result … Read more

A Stochastic Trust Region Algorithm Based on Careful Step Normalization

Published: 2017/12/29, Updated: 2018/06/26

Frank E. Curtis

Katya Scheinberg

Rui Shi

Nonlinear Optimization, Stochastic Programming, Unconstrained Optimization deep neural networks, finite sum minimization, logistic regression, machine learning, stochastic gradient method, stochastic optimization, trust-region methods

An algorithm is proposed for solving stochastic and finite sum minimization problems. Based on a trust region methodology, the algorithm employs normalized steps, at least as long as the norms of the stochastic gradient estimates are within a specified interval. The complete algorithm—which dynamically chooses whether or not to employ normalized steps—is proved to have … Read more

Production Lot Sizing with Immediately Observable Random Production Rate

Published: 2017/12/26

Ilan Adler

Dan Bu

Philip Kaminsky

Applications - OR and Management Sciences, Production and Logistics inventory control, production lot-sizing, random production rate

To explore one impact of the information available by adding sensors in a classical production planning setting, we consider a continuous time infinite horizon lot-sizing model where a single product is manufactured on a single machine. Each time manufacturing restarts, a random production rate is realized, and production continues at this rate until the machine … Read more

Let’s Make Block Coordinate Descent Go Fast: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence

Published: 2017/12/23

Issam Laradji

Mark Schmidt

Julie Nutini

Convex and Nonsmooth Optimization block-coordinate descent, convex optimization, nonsmooth optimization

Block coordinate descent (BCD) methods are widely-used for large-scale numerical optimization because of their cheap iteration costs, low memory requirements, amenability to parallelization, and ability to exploit problem structure. Three main algorithmic choices influence the performance of BCD methods: the block partitioning strategy, the block selection rule, and the block update rule. In this paper … Read more

Matrices with lexicographically-ordered rows

Published: 2017/12/22, Updated: 2018/09/06

Gustavo Angulo

(Mixed) Integer Linear Programming

The lexicographic order can be used to force a collection of decision vectors to be all different, i.e., to take on different values in some coordinates. We consider the set of fixed-size matrices with bounded integer entries and rows in lexicographic order. We present a dynamic program to optimize a linear function over this set, … Read more

An Algorithm for Piecewise Linear Optimization of Objective Functions in Abs-normal Form

Published: 2017/12/22

Andreas Griewank

Andrea Walther

Nonsmooth Optimization abs-normal form, active set and signature, karush kuhn tucker (kkt), linear independence kink qualification (likq), normal growth, quadratic regularization, successive piecewise linear optimization (splop), tangential stationarity

In the paper [11] we derived first order (KKT) and second order (SSC) optimality conditions for functions defined by evaluation programs involving smooth elementals and absolute values. For this class of problems we showed in [12] that the natural algorithm of successive piecewise linear optimization with a proximal term (SPLOP) achieves a linear or even … Read more