Optimization in Data Science – Page 3

An Extended Validity Domain for Constraint Learning

Published: 2024/06/19

Data Science Algorithms, Optimization in Data Science Constraint learning, convex hull, machine learning, mixed-integer programming, validity domain

We consider embedding a predictive machine-learning model within a prescriptive optimization problem. In this setting, called constraint learning, we study the concept of a validity domain, i.e., a constraint added to the feasible set, which keeps the optimization close to the training data, thus helping to ensure that the computed optimal solution exhibits less prediction … Read more

A mathematical introduction to SVMs with self-concordant kernel

Published: 2024/06/05, Updated: 2024/12/12

Florian Jarre

Data Science Algorithms, Data-Mining, Quadratic Programming continuity, kernel, Support Vector Machine

A derivation of so-called “soft-margin support vector machines with kernel” is presented along with elementary proofs that do not rely on concepts from functional analysis such as Mercer’s theorem or reproducing kernel Hilbert spaces which are frequently cited in this context. The analysis leads to new continuity properties of the kernel functions, in particular a … Read more

Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances

Published: 2024/05/30

Jie Wang

Optimization in Data Science Finite-Sample Performance Guarantee, Kernel Max-Sliced Wasserstein Distance, quality of semidefinite relaxations

Optimal transport has been very successful for various machine learning tasks; however, it is known to suffer from the curse of dimensionality. Hence, dimensionality reduction is desirable when applied to high-dimensional data with low-dimensional structures. The kernel max-sliced (KMS) Wasserstein distance is developed for this purpose by finding an optimal nonlinear mapping that reduces data … Read more

A graph-structured distance for mixed-variable domains with meta variables

Published: 2024/05/20, Updated: 2024/08/22

Data Science Theory, Optimization in Data Science distances, heterogeneous datasets, machine learning, meta variables

Heterogeneous datasets emerge in various machine learning and optimization applications that feature different input sources, types or formats. Most models or methods do not natively tackle heterogeneity. Hence, such datasets are often partitioned into smaller and simpler ones, which may limit the generalizability or performance, especially if data is limited. The first main contribution of … Read more

Mixed-Integer Linear Optimization for Cardinality-Constrained Random Forests

Published: 2024/05/16

Jan Pablo Burgard

Maria Eduarda Pinheiro

Martin Schmidt

(Mixed) Integer Linear Programming, Optimization in Data Science cardinality constraints, Mixed-Integer Linear Optimization, preprocessing, Random forests, Semi-Supervised Learning

Random forests are among the most famous algorithms for solving classification problems, in particular for large-scale data sets. Considering a set of labeled points and several decision trees, the method takes the majority vote to classify a new given point. In some scenarios, however, labels are only accessible for a proper subset of the given … Read more

A Proximal-Gradient Method for Constrained Optimization

Published: 2024/04/10

Daniel P. Robinson

Yutong Dai

Xiaoyi Qu

Nonlinear Optimization, Optimization in Data Science nonconvex optimization, nonlinear optimization, regularization methods, sequential quadratic optimization, sequential quadratic programming, worst-case iteration-complexity

We present a new algorithm for solving optimization problems with objective functions that are the sum of a smooth function and a (potentially) nonsmooth regularization function, and nonlinear equality constraints. The algorithm may be viewed as an extension of the well-known proximal-gradient method that is applicable when constraints are not present. To account for nonlinear … Read more

Learning-to-Optimize with PAC-Bayesian Guarantees: Theoretical Considerations and Practical Implementation

Published: 2024/04/04

Michael Sucker

Jalal Fadili

Peter Ochs

Optimization in Data Science, Other Topics learning-to-optimize, pac-bayes

We use the PAC-Bayesian theory for the setting of learning-to-optimize. To the best of our knowledge, we present the first framework to learn optimization algorithms with provable generalization guarantees (PAC-Bayesian bounds) and explicit trade-off between convergence guarantees and convergence speed, which contrasts with the typical worst-case analysis. Our learned optimization algorithms provably outperform related ones … Read more

Stochastic Aspects of Dynamical Low-Rank Approximation in the Context of Machine Learning

Published: 2024/03/23, Updated: 2024/05/15

Data Science Theory, Nonlinear Optimization, Optimization in Data Science deep neural networks, Dynamical Low-Rank Approximation (DLRA), Dynamical Low-Rank Training13 (DLRT), machine learning, stochastic gradient descent

The central challenges of today’s neural network architectures are the prohibitive memory footprint and the training costs associated with determining optimal weights and biases. A large portion of research in machine learning is therefore dedicated to constructing memory-efficient training methods. One promising approach is dynamical low-rank training (DLRT) which represents and trains parameters as a … Read more

Data Collaboration Analysis Over Matrix Manifolds

Published: 2024/03/05

Keiyu Nosaka

Akiko Yoshise

Linear, Cone and Semidefinite Programming, Optimization in Data Science, Other Topics

The effectiveness of machine learning (ML) algorithms is deeply intertwined with the quality and diversity of their training datasets. Improved datasets, marked by superior quality, enhance the predictive accuracy and broaden the applicability of models across varied scenarios. Researchers often integrate data from multiple sources to mitigate biases and limitations of single-source datasets. However, this … Read more

Robust support vector machines via conic optimization

Published: 2024/02/02

Shaoning Han

Andrés Gómez

(Mixed) Integer Nonlinear Programming, Cone Programming, Optimization in Data Science convexification, indicator variables, Mixed-integer nonlinear optimization, robustness, Support Vector Machine

We consider the problem of learning support vector machines robust to uncertainty. It has been established in the literature that typical loss functions, including the hinge loss, are sensible to data perturbations and outliers, thus performing poorly in the setting considered. In contrast, using the 0-1 loss or a suitable non-convex approximation results in robust … Read more