Robert M. Freund – Optimization Online

The Role of Level-Set Geometry on the Performance of PDHG for Conic Linear Optimization

Published: 2024/06/04, Updated: 2024/07/15

We consider solving huge-scale instances of (convex) conic linear optimization problems, at the scale where matrix-factorization-free methods are attractive or necessary. The restarted primal-dual hybrid gradient method (rPDHG) — with heuristic enhancements and GPU implementation — has been very successful in solving huge-scale linear programming (LP) problems; however its application to more general conic convex … Read more

Computational Guarantees for Restarted PDHG for LP based on “Limiting Error Ratios” and LP Sharpness

Published: 2023/12/27, Updated: 2024/04/29

Zikai Xiong

Robert M. Freund

Convex Optimization, Linear Programming, Linear, Cone and Semidefinite Programming computational complexity, error ratio, first-order methods, Linear Optimization, linear program, PDHG, restarts, Sharpness.

In recent years, there has been growing interest in solving linear optimization problems – or more simply “LP” – using first-order methods in order to avoid the costly matrix factorizations of traditional methods for huge-scale LP instances. The restarted primal-dual hybrid gradient method (PDHG) – together with some heuristic techniques – has emerged as a … Read more

On the Relation Between LP Sharpness and Limiting Error Ratio and Complexity Implications for Restarted PDHG

Published: 2023/12/27, Updated: 2023/12/29

Zikai Xiong

Robert M. Freund

There has been a recent surge in development of first-order methods (FOMs) for solving huge-scale linear programming (LP) problems. The attractiveness of FOMs for LP stems in part from the fact that they avoid costly matrix factorization computation. However, the efficiency of FOMs is significantly influenced – both in theory and in practice – by … Read more

Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization

Published: 2022/08/29, Updated: 2023/11/22

Zikai Xiong

Robert M. Freund

Constrained Nonlinear Optimization, Convex Optimization, Stochastic Programming computational complexity, convex optimization, empirical risk minimization, frank-wolfe, linear minimization oracle, linear prediction

\(\) The Frank-Wolfe method has become increasingly useful in statistical and machine learning applications, due to the structure-inducing properties of the iterates, and especially in settings where linear minimization over the feasible set is more computationally efficient than projection. In the setting of Empirical Risk Minimization — one of the fundamental optimization problems in statistical … Read more

Analysis of the Frank-Wolfe Method for Convex Composite Optimization involving a Logarithmically-Homogeneous Barrier

Published: 2021/06/15, Updated: 2021/07/03

Robert M. Freund

Renbo Zhao

Convex and Nonsmooth Optimization, Convex Optimization barrier, complexity, composite optimization, frank-wolfe, logarithmic-homogeneity, self-concordance

We present and analyze a new generalized Frank-Wolfe method for the composite optimization problem (P): F*:= min_x f(Ax) + h(x), where f is a \theta-logarithmically-homogeneous self-concordant barrier and the function h has bounded domain but is possibly non-smooth. We show that our generalized Frank-Wolfe method requires O((Gap_0 + \theta + Var_h)\ln(\delta_0) + (\theta + Var_h)^2/\epsilon) … Read more

An Oblivious Ellipsoid Algorithm for Solving a System of (In)Feasible Linear Inequalities

Published: 2019/10/08, Updated: 2020/12/28

Robert M. Freund

Michael J. Todd

Jourdain Lamperski

Convex Optimization, Linear Programming, Nonsmooth Optimization certificates, complexity, condition measures, ellipsoid algorithm, linear inequalities

The ellipsoid algorithm is a fundamental algorithm for computing a solution to the system of m linear inequalities in n variables (P) when its set of solutions has positive volume. However, when (P) is infeasible, the ellipsoid algorithm has no mechanism for proving that (P) is infeasible. This is in contrast to the other two … Read more

Condition Number Analysis of Logistic Regression, and its Implications for Standard First-Order Solution Methods

Published: 2018/10/19

Robert M. Freund

Rahul Mazumder

Paul Grigas

Convex Optimization, Data-Mining, Statistics condition numbers, logistic regression, steepest descent, stochastic gradient descent

Logistic regression is one of the most popular methods in binary classification, wherein estimation of model parameters is carried out by solving the maximum likelihood (ML) optimization problem, and the ML estimator is defined to be the optimal solution of this problem. It is well known that the ML estimator exists when the data is … Read more

Generalized Stochastic Frank-Wolfe Algorithm with Stochastic “Substitute” Gradient for Structured Convex Optimization

Published: 2018/07/29, Updated: 2018/09/04

Robert M. Freund

Haihao Lu

Convex Optimization, Nonlinear Optimization, Stochastic Programming complexity, conditional gradient, frank-wolfe, stochastic gradient

The stochastic Frank-Wolfe method has recently attracted much general interest in the context of optimization for statistical and machine learning due to its ability to work with a more general feasible region. However, there has been a complexity gap in the guaranteed convergence rate for stochastic Frank-Wolfe compared to its deterministic counterpart. In this work, … Read more

Relatively-Smooth Convex Optimization by First-Order Methods, and Applications

Published: 2016/10/19, Updated: 2017/10/10

Robert M. Freund

Haihao Lu

Yurii Nesterov

Convex and Nonsmooth Optimization, Convex Optimization complexity, d-optimal design, dual averaging, first-order methods, large-scale optimization, primal gradient

The usual approach to developing and analyzing first-order methods for smooth convex optimization assumes that the gradient of the objective function is uniformly smooth with some Lipschitz constant L. However, in many settings the differentiable convex function f(.) is not uniformly smooth — for example in D-optimal design where f(x):=-ln det(HXH^T), or even the univariate … Read more

An Extended Frank-Wolfe Method with “In-Face” Directions, and its Application to Low-Rank Matrix Completion

Published: 2015/11/06

Robert M. Freund

Rahul Mazumder

Paul Grigas

Convex Optimization, Data-Mining conditional gradient, first-order methods, frank-wolfe, low rank, matrix completion, nuclear norm

We present an extension of the Frank-Wolfe method that is designed to induce near-optimal solutions on low-dimensional faces of the feasible region. We present computational guarantees for the method that trade off efficiency in computing near-optimal solutions with upper bounds on the dimension of minimal faces of iterates. We apply our method to the low-rank … Read more