Data-Mining – Optimization Online

A mathematical introduction to SVMs with self-concordant kernel

Published: 2024/06/05, Updated: 2024/06/06

Data Science Algorithms, Data-Mining, Quadratic Programming continuity, kernel, Support Vector Machine

A derivation of so-called “soft-margin Support Vector Machines with kernel” is presented which does not rely on concepts from functional analysis such as Mercer’s theorem that is frequently cited in this context, and that leads to a new analysis of the continuity properties of the kernel functions such as a new self-concordance condition for the … Read more

Optimization Over Trained Neural Networks: Taking a Relaxing Walk

Published: 2024/01/29

Jiatai Tong

Junyang Cai

Thiago Serra

(Mixed) Integer Linear Programming, Data-Mining, Meta Heuristics

Besides training, mathematical optimization is also used in deep learning to model and solve formulations over trained neural networks for purposes such as verification, compression, and optimization with learned constraints. However, solving these formulations soon becomes difficult as the network size grows due to the weak linear relaxation and dense constraint matrix. We have seen … Read more

Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

Published: 2023/05/15, Updated: 2024/01/26

Data-Mining, Global Optimization Theory, Semi-definite Programming

Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible. Unfortunately, existing methods for matrix completion are heuristics that, while highly scalable and often identifying high-quality solutions, do not possess any optimality guarantees. We reexamine matrix completion with an optimality-oriented eye. We reformulate … Read more

Bilevel optimization with a multi-objective lower-level problem: Risk-neutral and risk-averse formulations

Published: 2023/02/10, Updated: 2023/02/12

Tommaso Giovannelli

Griffin D. Kent

Luis Nunes Vicente

Data-Mining, Multi-Criteria Optimization, Stochastic Programming bilevel optimization, multi-objective optimization, risk averse, risk-neutral

In this work, we propose different formulations and gradient-based algorithms for deterministic and stochastic bilevel problems with conflicting objectives in the lower level. Such problems have received little attention in the deterministic case and have never been studied from a stochastic approximation viewpoint despite the recent advances in stochastic methods for single-level, bilevel, and multi-objective … Read more

Sparse PCA With Multiple Components

Published: 2022/09/29, Updated: 2023/10/31

Ryan Cory-Wright

Jean Pauphilet

Data Science Theory, Data-Mining, Semi-definite Programming

Sparse Principal Component Analysis (sPCA) is a cardinal technique for obtaining combinations of features, or principal components (PCs), that explain the variance of high-dimensional datasets in an interpretable manner. This involves solving a sparsity and orthogonality-constrained convex maximization problem, which is extremely computationally challenging. Most existing works address sparse PCA via methods—such as iteratively computing … Read more

A new sufficient condition for non-convex sparse recovery via weighted $\ell_r\!-\!\ell_1$ minimization

Published: 2022/04/16

Jianwen Huang

Feng Zhang

Data-Mining, Nonsmooth Optimization compressed sensing, Constrained weighted $\ell_r-\ell_1$ minimization, Nonconvex sparse recovery, restricted isometry property

In this letter, we discuss the reconstruction of sparse signals from undersampled data, which belongs to the core content of compressed sensing. A new sufficient condition in terms of the restricted isometry constant (RIC) and restricted orthogonality constant (ROC) is first established for the performance guarantee of recently proposed non-convex weighted $\ell_r-\ell_1$ minimization in recovering … Read more

The Null Space Property of the Weighted $\ell_r-\ell_1$ Minimization

Published: 2022/04/02

Jianwen Huang

Xinling Liu

Data-Mining, Nonsmooth Optimization, Robust Optimization compressed sensing, null space property, sparse signal recovery, the weighted $\ell_r-\ell_1$ minimization

The null space property (NSP), which relies merely on the null space of the sensing matrix column space, has drawn numerous interests in sparse signal recovery. This article studies NSP of the weighted $\ell_r-\ell_1$ minimization. Several versions of NSP of the weighted $\ell_r-\ell_1$ minimization including the weighted $\ell_r-\ell_1$ NSP, the weighted $\ell_r-\ell_1$ stable NSP, the … Read more

The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Published: 2022/03/10

Data-Mining, Meta Heuristics, Quadratic Programming loss function, network pruning, neural networks

Neural networks tend to achieve better accuracy with training if they are larger — even if the resulting models are overparameterized. Nevertheless, carefully removing such excess parameters before, during, or after training may also produce models with similar or even improved accuracy. In many cases, that can be curiously achieved by heuristics as simple as … Read more

Mixed-Integer Programming Techniques for the Minimum Sum-of-Squares Clustering Problem

Published: 2022/03/09, Updated: 2022/11/29

(Mixed) Integer Nonlinear Programming, Cutting Plane Approaches, Data-Mining Computational Techniques, global optimization, Minimum Sum-of-Squares Clustering, Mixed-integer nonlinear optimization

The minimum sum-of-squares clustering problem is a very important problem in data mining and machine learning with very many applications in, e.g., medicine or social sciences. However, it is known to be NP-hard in all relevant cases and to be notoriously hard to be solved to global optimality in practice. In this paper, we develop … Read more

Kernel Probabilistic Distance Clustering

Published: 2022/02/07

Dilay Ozkan

Cem Iyigun

Constrained Nonlinear Optimization, Data-Mining clustering, kernel functions, probabilistic distance clustering

Citation Department of Industrial Engineering, Middle East Technical University, Ankara, Turkey Article Download View Kernel Probabilistic Distance Clustering