Optimization in Data Science – Page 2

Guaranteed bounds for optimal stopping problems using kernel-based non-asymptotic uniform confidence bands

Published: 2024/11/22, Updated: 2025/03/18

Dynamic Programming, Optimization in Data Science, Stochastic Programming approximate dynamic programming, Finite sample guarantees, optimal stopping, Reproducing kernel Hilbert spaces, stochastic programming

In this paper, we introduce an approach for obtaining probabilistically guaranteed upper and lower bounds on the true optimal value of stopping problems. Bounds of existing simulation-and-regression approaches, such as those based on least squares Monte Carlo and information relaxation, are stochastic in nature and therefore do not come with a finite sample guarantee. Our … Read more

Decision-focused predictions via pessimistic bilevel optimization: complexity and algorithms

Published: 2024/11/22

Constrained Nonlinear Optimization, Data Science Applications, Other Topics

Dealing with uncertainty in optimization parameters is an important and longstanding challenge. Typically, uncertain parameters are predicted accurately, and then a deterministic optimization problem is solved. However, the decisions produced by this so-called predict-then-optimize procedure can be highly sensitive to uncertain parameters. In this work, we contribute to recent efforts in producing decision-focused predictions, i.e., to … Read more

Forecasting Outside the Box: Application-Driven Optimal Pointwise Forecasts for Stochastic Optimization

Published: 2024/11/05, Updated: 2024/11/08

Tito Homem-de-Mello

Data Science Algorithms, Stochastic Programming Stochastic optimization; contextual information; machine learning, stochastic programming

The exponential growth in data availability in recent years has led to new formulations of data-driven optimization problems. One such formulation is that of stochastic optimization problems with contextual information, where the goal is to optimize the expected value of a certain function given some contextual information (also called features) that accompany the main data … Read more

Missing Value Imputation via Mathematical Optimization with Instance-and-Feature Neighborhoods

Published: 2024/10/25, Updated: 2024/10/28

Yuichi Takano

(Mixed) Integer Nonlinear Programming, Optimization in Data Science Alternating Optimization, mathematical optimization, missing value imputation, nearest neighbors, warm-start

Datasets collected for analysis often contain a certain amount of incomplete instances, where some feature values are missing. Since many statistical analyses and machine learning algorithms depend on complete datasets, missing values need to be imputed in advance. Bertsimas et al. (2018) proposed a high-performance method that combines machine learning and mathematical optimization algorithms for … Read more

A Generalization Result for Convergence in Learning-to-Optimize

Published: 2024/10/10

Michael Sucker

Peter Ochs

Convex and Nonsmooth Optimization, Nonsmooth Optimization, Optimization in Data Science learning-to-optimize, pac-bayes, stationary points

Convergence in learning-to-optimize is hardly studied, because conventional convergence guarantees in optimization are based on geometric arguments, which cannot be applied easily to learned algorithms. Thus, we develop a probabilistic framework that resembles deterministic optimization and allows for transferring geometric arguments into learning-to-optimize. Our main theorem is a generalization result for parametric classes of potentially … Read more

Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits

Published: 2024/10/07, Updated: 2025/02/13

Mengmeng Li

Daniel Kuhn

Bahar Tașkesen

Convex Optimization, Data Science Algorithms, Data Science Applications bandits, discrete choice models, online learning

Follow-The-Regularized-Leader (FTRL) algorithms often enjoy optimal regret for adversarial as well as stochastic bandit problems and allow for a streamlined analysis. However, FTRL algorithms require the solution of an optimization problem in every iteration and are thus computationally challenging. In contrast, Follow-The-Perturbed-Leader (FTPL) algorithms achieve computational efficiency by perturbing the estimates of the rewards of … Read more

Single-Loop Deterministic and Stochastic Interior-Point Algorithms for Nonlinearly Constrained Optimization

Published: 2024/08/28

Frank E. Curtis

Qi Wang

Xin Jiang

Constrained Nonlinear Optimization, Nonlinear Optimization, Optimization in Data Science

An interior-point algorithm framework is proposed, analyzed, and tested for solving nonlinearly constrained continuous optimization problems. The main setting of interest is when the objective and constraint functions may be nonlinear and/or nonconvex, and when constraint values and derivatives are tractable to compute, but objective function values and derivatives can only be estimated. The algorithm … Read more

A Markovian Model for Learning-to-Optimize

Published: 2024/08/21

Michael Sucker

Peter Ochs

Optimization in Data Science, Stochastic Approaches, Stochastic Programming convergence rate, learning-to-optimize, pac-bayes, stochastic processes, stopping time

We present a probabilistic model for stochastic iterative algorithms with the use case of optimization algorithms in mind. Based on this model, we present PAC-Bayesian generalization bounds for functions that are defined on the trajectory of the learned algorithm, for example, the expected (non-asymptotic) convergence rate and the expected time to reach the stopping criterion. … Read more

Forecasting Urban Traffic States with Sparse Data Using Hankel Temporal Matrix Factorization

Published: 2024/08/13

Chun Cheng

Data Science Algorithms, Data Science Applications, Transportation hankel matrix, machine learning, matrix factorization, traffic state forecasting, Urban transportation network

Forecasting urban traffic states is crucial to transportation network monitoring and management, playing an important role in the decision-making process. Despite the substantial progress that has been made in developing accurate, efficient, and reliable algorithms for traffic forecasting, most existing approaches fail to handle sparsity, high-dimensionality, and nonstationarity in traffic time series and seldom consider … Read more

Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks

Published: 2024/07/19

Anirbit Mukherjee

Data Science Algorithms, Global Optimization Theory, Nonlinear Optimization

In this work, we instantiate a regularized form of the gradient clipping algorithm and prove that it can converge to the global minima of deep neural network loss functions provided that the net is of sufficient width. We present empirical evidence that our theoretically founded regularized gradient clipping algorithm is also competitive with the state-of-the-art … Read more