Optimization in Data Science – Page 2

Linear Model Extraction via Factual and Counterfactual Queries

Published: 2026/02/10, Updated: 2026/03/03

Data Science Algorithms, Data Science Theory, Optimization in Data Science Counterfactuals, Model Extraction, robust optimization

In model extraction attacks, the goal is to reveal the parameters of a black-box machine learning model by querying the model for a selected set of data points. Due to an increasing demand for explanations, this may involve counterfactual queries besides the typically considered factual queries. In this work, we consider linear models and three … Read more

Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach

Published: 2026/01/15

Jie Wang

Optimization in Data Science, Robust Optimization, Stochastic Programming data-driven stochastic programming, distributionally robust optimization (DRO)

In this paper, we introduce a framework for contextual distributionally robust optimization (DRO) that considers the causal and continuous structure of the underlying distribution by developing interpretable and tractable decision rules that prescribe decisions using covariates. We first introduce the causal Sinkhorn discrepancy (CSD), an entropy-regularized causal Wasserstein distance that encourages continuous transport plans while … Read more

A Majorization-Minimization approach for multiclass classification in a big data scenario

Published: 2026/01/08, Updated: 2026/01/09

Giorgia Franchini

Federica Porta

Émilie Chouzenoux

Jean-Christophe Pesquet

Filippo Camellini

Convex Optimization, Data Science Algorithms, Unconstrained Optimization Foundation Models, Incremental Minimization, machine learning, majorization-minimization method, Multiclass Classification, Support Vector Machine

This work presents a novel optimization approach for training linear classifiers in multiclass classification tasks, when focusing on a regularized and smooth Weston-Watkins support vector machine (SVM) model. We propose a Majorization-Minimization (MM) algorithm to solve the resulting, Lipschitz-differentiable, optimization problem. To enhance scalability of the algorithm when tackling large datasets, we introduce an incremental … Read more

A Geometric Perspective on Polynomially Solvable Convex Maximization

Published: 2026/01/05, Updated: 2026/04/21

Shaoning Han

Yongchun Li

Global Optimization, Integer Programming, Optimization in Data Science integer programming

Convex maximization encompasses a broad class of optimization problems and is generally NP-hard, even for low-rank objectives. This paper investigates structural conditions under which convex maximization becomes polynomially solvable. From a geometric perspective, we introduce comonotonicity, a structural property of the feasible region crucial for problem tractability, and establish mathematical characterizations of this property. Under comonotonicity and … Read more

An Inexact Modified Quasi-Newton Method for Nonsmooth Regularized Optimization

Published: 2026/01/05

Nathan Allaire

Sébastien Le Digabel

Dominique Orban

Data Science Algorithms, Nonlinear Optimization, Nonsmooth Optimization composite optimization, inexact evaluations, inexact proximal operator, modified quasi-Newton method, nonconvex optimization, nonsmooth optimization, proximal gradient method, proximal quasi-newton method, regularized optimization

We introduce method iR2N, a modified proximal quasi-Newton method for minimizing the sum of a $C^1$ function $f$ and a lower semi-continuous prox-bounded $h$ that permits inexact evaluations of $f$, $\nabla f$ and of the relevant proximal operators. Both $f$ and $h$ may be nonconvex. In applications where the proximal operator of $h$ is not … Read more

The Convexity Zoo: A Taxonomy of Function Classes in Optimization

Published: 2026/01/01, Updated: 2026/05/10

Abbas Khademi

Generalized Convexity/Monoticity, Nonlinear Optimization, Optimization in Data Science generalized convexity, nonconvex optimization, quasar-convexity, star-convexity, structured nonconvexity

The tractability of optimization problems depends critically on structural properties of the objective function. Convexity guarantees global optimality of local solutions and enables polynomial-time algorithms under mild assumptions, but many problems arising in modern applications—particularly in machine learning—are inherently nonconvex. Remarkably, a large class of such problems remains amenable to efficient optimization due to additional … Read more

Primal-dual resampling for solution validation in convex stochastic programming

Published: 2025/12/26, Updated: 2026/04/01

Yi Chu

Susan R. Hunter

Raghu Pasupathy

Convex Optimization, Optimization in Data Science, Stochastic Programming statistical inference, stochastic optimization, stochastic programming

Suppose we wish to determine the quality of a candidate solution to a convex stochastic program in which the objective function is a statistical functional parameterized by the decision variable and known deterministic constraints may be present. Inspired by stopping criteria in primal-dual and interior-point methods, we develop cancellation theorems that characterize the convergence of … Read more

Fast and Simple Multiclass Data Segmentation: An Eigendecomposition and Projection-Free Approach

Published: 2025/12/18

Data Science Algorithms, Data Science Applications, Nonlinear Optimization

Graph-based machine learning has seen an increased interest over the last decade with many connections to other fields of applied mathematics. Learning based on partial differential equations, such as the phase-field Allen-Cahn equation, allows efficient handling of semi-supervised learning approaches on graphs. The numerical solution of the graph Allen-Cahn equation via a convexity splitting or … Read more

Iterative Sampling Methods for Sinkhorn Distributionally Robust Optimization

Published: 2025/12/13

Jie Wang

Infinite Dimensional Optimization, Optimization in Data Science, Robust Optimization

Distributionally robust optimization (DRO) has emerged as a powerful paradigm for reliable decision-making under uncertainty. This paper focuses on DRO with ambiguity sets defined via the Sinkhorn discrepancy: an entropy-regularized Wasserstein distance, referred to as Sinkhorn DRO. Existing work primarily addresses Sinkhorn DRO from a dual perspective, leveraging its formulation as a conditional stochastic optimization … Read more

An Elementary Proof of the Near Optimality of LogSumExp Smoothing

Published: 2025/12/11, Updated: 2026/01/19

Thabo Samakhoana

Benjamin Grimmer

Convex Optimization, Data Science Algorithms, Nonsmooth Optimization Elementary, LogSumExp, Lower bound, nonsmooth optimization, smoothing

We consider the design of smoothings of the (coordinate-wise) max function in $\mathbb{R}^d$ in the infinity norm. The LogSumExp function $f(x)=\ln(\sum^d_i\exp(x_i))$ provides a classical smoothing, differing from the max function in value by at most $\ln(d)$. We provide an elementary construction of a lower bound, establishing that every overestimating smoothing of the max function must … Read more