Statistics – Page 4 – Optimization Online

Dual optimal design and the Christoffel-Darboux polynomial

Published: 2020/09/07

Convex Optimization, Semi-definite Programming, Statistics convex analysis, data science, semidefinite programming, statistics

The purpose of this short note is to show that the Christoffel-Darboux polynomial, useful in approximation theory and data science, arises naturally when deriving the dual to the problem of semi-algebraic D-optimal experimental design in statistics. It uses only elementary notions of convex analysis. ArticleDownload View PDF

Exact Penalty Function for L21 Norm Minimization over the Stiefel Manifold

Published: 2020/07/19, Updated: 2022/08/01

Nachuan Xiao

Xin Liu

Ya-xiang Yuan

Constrained Nonlinear Optimization, Statistics augmented lagrangian method, nonsmooth optimization, orthogonality constraint, stiefel manifold

L21 norm minimization with orthogonality constraints, feasible region of which is called Stiefel manifold, has wide applications in statistics and data science. The state-of-the-art approaches adopt proximal gradient technique on either Stiefel manifold or its tangent spaces. The consequent subproblem does not have closed-form solution and hence requires an iterative procedure to solve which is … Read more

The block mutual coherence property condition for signal recovery

Published: 2020/07/10

Data-Mining, Nonsmooth Optimization, Statistics

Compressed sensing shows that a sparse signal can stably be recovered from incomplete linear measurements. But, in practical applications, some signals have additional structure, where the nonzero elements arise in some blocks. We call such signals as block-sparse signals. In this paper, the $\ell_2/\ell_1-\alpha\ell_2$ minimization method for the stable recovery of block-sparse signals is investigated. … Read more

An Analysis of Constant Step Size SGD in the Non-convex Regime: Asymptotic Normality and Bias

Published: 2020/06/14, Updated: 2020/06/15

Krishnakumar Balasubramanian

Stanislav Volgushev

Murat A. Erdogdu

Lu Yu

Statistics, Stochastic Approaches, Stochastic Programming asymptotic normality, constant step-size sgd, stochastic optimization

Structured non-convex learning problems, for which critical points have favorable statistical properties, arise frequently in statistical machine learning. Algorithmic convergence and statistical estimation rates are well-understood for such problems. However, quantifying the uncertainty associated with the underlying training algorithm is not well-studied in the non-convex setting. In order to address this short-coming, in this work, … Read more

Linear Programming and Community Detection

Published: 2020/06/04, Updated: 2022/05/11

Alberto Del Pia

Aida Khajavirad

Dmitriy Kunisky

0-1 Programming, Polyhedra, Statistics community detection, linear programming, metric polytope, minimum bisection problem

The problem of community detection with two equal-sized communities is closely related to the minimum graph bisection problem over certain random graph models. In the stochastic block model distribution over networks with community structure, a well-known semidefinite programming (SDP) relaxation of the minimum bisection problem recovers the underlying communities whenever possible. Motivated by their superior … Read more

Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

Published: 2020/05/11, Updated: 2022/09/27

Dimitris Bertsimas

Ryan Cory-Wright

Jean Pauphilet

(Mixed) Integer Nonlinear Programming, Semi-definite Programming, Statistics mixed-integer programming, semidefinite programming, sparse eigenvalues, sparse principal component analysis

Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features. Existing approaches cannot supply certifiably optimal principal components with more than $p=100s$ of variables. By reformulating sparse PCA as a convex mixed-integer semidefinite optimization problem, we design a … Read more

The high-order block RIP for non-convex block-sparse compressed sensing

Published: 2020/03/07

Data-Mining, Statistics

This paper concentrates on the recovery of block-sparse signals, which is not only sparse but also nonzero elements are arrayed into some blocks (clusters) rather than being arbitrary distributed all over the vector, from linear measurements. We establish high-order sufficient conditions based on block RIP to ensure the exact recovery of every block $s$-sparse signal … Read more

Safe screening rules for L0-Regression

Published: 2020/02/18

Alper Atamturk

Andrés Gómez

(Mixed) Integer Nonlinear Programming, Statistics perspective reformulation, presolve, screening rules, sparse regression

We give safe screening rules to eliminate variables from regression with L0 regularization or cardinality constraint. These rules are based on guarantees that a feature may or may not be selected in an optimal solution. The screening rules can be computed from a convex relaxation solution in linear time, without solving the L0 optimization problem. … Read more

Learning Optimal Classification Trees: Strong Max-Flow Formulations

Published: 2020/02/17, Updated: 2020/04/27

Sina Aghaei

Andrés Gómez

Phebe Vayanos

(Mixed) Integer Linear Programming, Combinatorial Optimization, Statistics benders decomposition, machine learning, mixed-integer programming, optimal decision-trees

We consider the problem of learning optimal binary classification trees. Literature on the topic has burgeoned in recent years, motivated both by the empirical suboptimality of heuristic approaches and the tremendous improvements in mixed-integer programming (MIP) technology. Yet, existing approaches from the literature do not leverage the power of MIP to its full extent. Indeed, … Read more

Outlier detection in time series via mixed-integer conic quadratic optimization

Published: 2019/11/21, Updated: 2021/04/28

Andrés Gómez

(Mixed) Integer Nonlinear Programming, Statistics conic quadratic optimization, convexification, lifting, mixed-integer programming, outlier detection, trimmed least squares

We consider the problem of estimating the true values of a Wiener process given noisy observations corrupted by outliers. The problem considered is closely related to the Trimmed Least Squares estimation problem, a robust estimation procedure well-studied from a statistical standpoint but poorly understood from an optimization perspective. In this paper we show how to … Read more