Optimization in Data Science – Page 8

A mathematical introduction to SVMs with self-concordant kernel

Published: 2024/06/05, Updated: 2024/12/12

Data Science Algorithms, Data-Mining, Quadratic Programming continuity, kernel, Support Vector Machine

A derivation of so-called “soft-margin support vector machines with kernel” is presented along with elementary proofs that do not rely on concepts from functional analysis such as Mercer’s theorem or reproducing kernel Hilbert spaces which are frequently cited in this context. The analysis leads to new continuity properties of the kernel functions, in particular a … Read more

Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances

Published: 2024/05/30, Updated: 2025/02/06

Jie Wang

Optimization in Data Science Finite-Sample Performance Guarantee, Kernel Max-Sliced Wasserstein Distance, quality of semidefinite relaxations

Optimal transport has been very successful for various machine learning tasks; however, it is known to suffer from the curse of dimensionality. Hence, dimensionality reduction is desirable when applied to high-dimensional data with low-dimensional structures. The kernel max-sliced~(KMS) Wasserstein distance is developed for this purpose by finding an optimal nonlinear mapping that reduces data into … Read more

A graph-structured distance for mixed-variable domains with meta variables

Published: 2024/05/20, Updated: 2024/08/22

Data Science Theory, Optimization in Data Science distances, heterogeneous datasets, machine learning, meta variables

Heterogeneous datasets emerge in various machine learning and optimization applications that feature different input sources, types or formats. Most models or methods do not natively tackle heterogeneity. Hence, such datasets are often partitioned into smaller and simpler ones, which may limit the generalizability or performance, especially if data is limited. The first main contribution of … Read more

Mixed-Integer Linear Optimization for Cardinality-Constrained Random Forests

Published: 2024/05/16, Updated: 2025/01/23

Jan Pablo Burgard

Maria Eduarda Pinheiro

Martin Schmidt

(Mixed) Integer Linear Programming, Optimization in Data Science cardinality constraints, Mixed-Integer Linear Optimization, preprocessing, Random forests, Semi-Supervised Learning

Random forests are among the most famous algorithms for solving classification problems, in particular for large-scale data sets. Considering a set of labeled points and several decision trees, the method takes the majority vote to classify a new given point. In some scenarios, however, labels are only accessible for a proper subset of the given … Read more

A Proximal-Gradient Method for Constrained Optimization

Published: 2024/04/10

Daniel P. Robinson

Yutong Dai

Xiaoyi Qu

Nonlinear Optimization, Optimization in Data Science nonconvex optimization, nonlinear optimization, regularization methods, sequential quadratic optimization, sequential quadratic programming, worst-case iteration-complexity

We present a new algorithm for solving optimization problems with objective functions that are the sum of a smooth function and a (potentially) nonsmooth regularization function, and nonlinear equality constraints. The algorithm may be viewed as an extension of the well-known proximal-gradient method that is applicable when constraints are not present. To account for nonlinear … Read more

Learning-to-Optimize with PAC-Bayesian Guarantees: Theoretical Considerations and Practical Implementation

Published: 2024/04/04, Updated: 2025/02/25

Michael Sucker

Jalal Fadili

Peter Ochs

Optimization in Data Science, Other Topics learning-to-optimize, pac-bayes

We use the PAC-Bayesian theory for the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-Bayesian bounds) and explicit trade-off between convergence guarantees and convergence speed, which contrasts with the typical worst-case analysis. Our learned optimization algorithms provably outperform related ones … Read more

Stochastic Aspects of Dynamical Low-Rank Approximation in the Context of Machine Learning

Published: 2024/03/23, Updated: 2025/12/17

Data Science Theory, Nonlinear Optimization, Optimization in Data Science deep neural networks, Dynamical Low-Rank Approximation (DLRA), Dynamical Low-Rank Training13 (DLRT), machine learning, stochastic gradient descent

The central challenges of today’s neural network architectures are the prohibitive memory footprint and training costs associated with determining optimal weights and biases. A large portion of research in machine learning is therefore dedicated to constructing memory-efficient training methods. One promising approach is dynamical low-rank training (DLRT), which represents and trains parameters as a low-rank … Read more

Data Collaboration Analysis with Orthonormal Basis Selection and Alignment

Published: 2024/03/05, Updated: 2025/12/17

Keiyu Nosaka

Yuichi Takano

Akiko Yoshise

Linear, Cone and Semidefinite Programming, Optimization in Data Science, Other Topics

Data Collaboration (DC) enables multiple parties to jointly train a model by sharing only linear projections of their private datasets. The core challenge in DC is to align the bases of these projections without revealing each party’s secret basis. While existing theory suggests that any target basis spanning the common subspace should suffice, in practice, … Read more

Robust support vector machines via conic optimization

Published: 2024/02/02

Shaoning Han

Andrés Gómez

(Mixed) Integer Nonlinear Programming, Cone Programming, Optimization in Data Science convexification, indicator variables, Mixed-integer nonlinear optimization, robustness, Support Vector Machine

We consider the problem of learning support vector machines robust to uncertainty. It has been established in the literature that typical loss functions, including the hinge loss, are sensible to data perturbations and outliers, thus performing poorly in the setting considered. In contrast, using the 0-1 loss or a suitable non-convex approximation results in robust … Read more

Data-Driven Reliable Facility Location Design

Published: 2024/01/30

Hao Shen

Mengying Xue

Z.J. Max Shen

Data Science Applications, Network Optimization, Supply Chain Management data-driven optimization, facility location, prescriptive analytics, supply chain disruption

We study the reliable (uncapacitated) facility location (RFL) problem in a data-driven environment where historical observations of random demands and disruptions are available. Owing to the combinatorial optimization nature of the RFL problem and the mixed-binary randomness of parameters therein, the state-of-the-art RFL models applied to the data-driven setting either suggest overly conservative solutions, or … Read more