Data Science Algorithms – Optimization Online

An extrapolated and provably convergent algorithm for nonlinear matrix decomposition with the ReLU function

Published: 2025/03/31

Data Science Algorithms, Optimization in Data Science, Other Topics block coordinate minimization, low rank matrix, nonlinear matrix decomposition

Nonlinear matrix decomposition (NMD) with the ReLU function, denoted ReLU-NMD, is the following problem: given a sparse, nonnegative matrix \(X\) and a factorization rank \(r\), identify a rank-\(r\) matrix \(\Theta\) such that \(X\approx \max(0,\Theta)\). This decomposition finds application in data compression, matrix completion with entries missing not at random, and manifold learning. The standard ReLU-NMD … Read more

A Rank-One-Update Method for the Training of Support Vector Machines

Published: 2025/03/13

Florian Jarre

Bound-constrained Optimization, Convex Optimization, Data Science Algorithms active set method, Rank-one-update, Support Vector Machine

This paper considers convex quadratic programs associated with the training of support vector machines (SVM). Exploiting the special structure of the SVM problem a new type of active set method with long cycles and stable rank-one-updates is proposed and tested (CMU: cycling method with updates). The structure of the problem allows for a repeated simple … Read more

Spherical Support Vector Machine for Interval-Valued Data

Published: 2025/02/24, Updated: 2025/02/27

Rui Malha

Paula Alexandra Amaral

Data Science Algorithms, Nonlinear Optimization, Quadratic Programming Classificaton, Interval-value data, SVM

In this work we propose a generalization of the Spherical Support Vector Machine method, in which the separator is a sphere, applied to Interval-valued data. This type of data belongs to a more general class, known as Symbolic Data, for which features are described by sets, intervals or histograms instead of classic arrays. This paradigm … Read more

prunAdag: an adaptive pruning-aware gradient method

Published: 2025/02/12

Margherita Porcelli

Giovanni Seraghiti

Philippe L. Toint

Data Science Algorithms, Nonlinear Optimization adaptive first-order methods, Complexity theory, Model pruning, objective-function-free optimization (OFFO)

A pruning-aware adaptive gradient method is proposed which classifies the variables in two sets before updating them using different strategies. This technique extends the “relevant/irrelevant” approach of Ding (2019) and Zimmer et al. (2022) and allows a posteriori sparsification of the solution of model parameter fitting problems. The new method is proved to be convergent … Read more

Fair Distributional Reinforcement Learning

Published: 2025/01/07

Zequn Chen

Data Science Algorithms, Data Science Applications

ArticleDownload View PDF

A Generalized Voting Game for Categorical Network Choices

Published: 2024/12/03

Lin Yueh

Stefano Nasini

Martine Labbé

Combinatorial Optimization, Data Science Algorithms, Game Theory Categorical regression, Combinatorial games, combinatorial optimization, Network influence

This paper presents a game-theoretical framework for data classification and network discovery, focusing on pairwise influences in multivariate choices. The framework consists of two complementary games in which individuals, connected through a signed weighted graph, exhibit network similarity. A voting rule captures the influence of an individual’s neighbors, categorized as attractive (friend-like) or repulsive (enemy-like), … Read more

Forecasting Outside the Box: Application-Driven Optimal Pointwise Forecasts for Stochastic Optimization

Published: 2024/11/05, Updated: 2024/11/08

Tito Homem-de-Mello

Data Science Algorithms, Stochastic Programming Stochastic optimization; contextual information; machine learning, stochastic programming

The exponential growth in data availability in recent years has led to new formulations of data-driven optimization problems. One such formulation is that of stochastic optimization problems with contextual information, where the goal is to optimize the expected value of a certain function given some contextual information (also called features) that accompany the main data … Read more

Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits

Published: 2024/10/07, Updated: 2025/02/13

Mengmeng Li

Daniel Kuhn

Bahar Tașkesen

Convex Optimization, Data Science Algorithms, Data Science Applications bandits, discrete choice models, online learning

Follow-The-Regularized-Leader (FTRL) algorithms often enjoy optimal regret for adversarial as well as stochastic bandit problems and allow for a streamlined analysis. However, FTRL algorithms require the solution of an optimization problem in every iteration and are thus computationally challenging. In contrast, Follow-The-Perturbed-Leader (FTPL) algorithms achieve computational efficiency by perturbing the estimates of the rewards of … Read more

Forecasting Urban Traffic States with Sparse Data Using Hankel Temporal Matrix Factorization

Published: 2024/08/13

Chun Cheng

Data Science Algorithms, Data Science Applications, Transportation hankel matrix, machine learning, matrix factorization, traffic state forecasting, Urban transportation network

Forecasting urban traffic states is crucial to transportation network monitoring and management, playing an important role in the decision-making process. Despite the substantial progress that has been made in developing accurate, efficient, and reliable algorithms for traffic forecasting, most existing approaches fail to handle sparsity, high-dimensionality, and nonstationarity in traffic time series and seldom consider … Read more

Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks

Published: 2024/07/19

Anirbit Mukherjee

Data Science Algorithms, Global Optimization Theory, Nonlinear Optimization

In this work, we instantiate a regularized form of the gradient clipping algorithm and prove that it can converge to the global minima of deep neural network loss functions provided that the net is of sufficient width. We present empirical evidence that our theoretically founded regularized gradient clipping algorithm is also competitive with the state-of-the-art … Read more