Peyman Mohajerin Esfahani – Optimization Online

Applications - OR and Management Sciences, Convex and Nonsmooth Optimization, Transportation

Bilge Atasoy

We propose a method for learning decision-makers’ behavior in routing problems using Inverse Optimization (IO). The IO framework falls into the supervised learning category and builds on the premise that the target behavior is an optimizer of an unknown cost function. This cost function is to be learned through historical data, and in the context … Read more

Nonlinear Distributionally Robust Optimization

Published: 2023/06/05

Rayyan

Infinite Dimensional Optimization, Optimization in Data Science, Robust Optimization Frank-Wolfe algorithm, gateaux differentiability, Saddle Point

This article focuses on a class of distributionally robust optimization (DRO) problems where, unlike the growing body of the literature, the objective function is potentially non-linear in the distribution. Existing methods to optimize nonlinear functions in probability space use the Frechet derivatives, which present both theoretical and computational challenges. Motivated by this, we propose an … Read more

Learning in Inverse Optimization: Incenter Cost, Augmented Suboptimality Loss, and Algorithms

Published: 2023/05/12, Updated: 2024/01/23

Pedro Zattoni Scroccaro

Bilge Atasoy

Convex and Nonsmooth Optimization first order algorithms, inverse optimization

In Inverse Optimization (IO), an expert agent solves an optimization problem parametric in an exogenous signal. From a learning perspective, the goal is to learn the expert’s cost function given a dataset of signals and corresponding optimal actions. Motivated by the geometry of the IO set of consistent cost vectors, we introduce the “incenter” concept, … Read more

Bridging Bayesian and Minimax Mean Square Error Estimation via Wasserstein Distributionally Robust Optimization

Published: 2019/11/08, Updated: 2019/11/12

Infinite Dimensional Optimization, Statistics, Stochastic Programming affine estimator, mean square error, wasserstein distance

We introduce a distributionally robust minimium mean square error estimation model with a Wasserstein ambiguity set to recover an unknown signal from a noisy observation. The proposed model can be viewed as a zero-sum game between a statistician choosing an estimator—that is, a measurable function of the observation—and a fictitious adversary choosing a prior—that is, … Read more

Wasserstein Distributionally Robust Optimization: Theory and Applications in Machine Learning

Published: 2019/08/23

Robust Optimization, Stochastic Programming

Many decision problems in science, engineering and economics are affected by uncertain parameters whose distribution is only indirectly observable through samples. The goal of data-driven decision-making is to learn a decision from finitely many training samples that will perform well on unseen test samples. This learning task is difficult even if all training and test … Read more

Wasserstein Distributionally Robust Kalman Filtering

Published: 2018/09/24, Updated: 2018/10/19

Robust Optimization, Statistics, Stochastic Programming distributionally robust optimization, kalman filter, mean square error estimator, wasserstein distance

We study a distributionally robust mean square error estimation problem over a nonconvex Wasserstein ambiguity set containing only normal distributions. We show that the optimal estimator and the least favorable distribution form a Nash equilibrium. Despite the non-convex nature of the ambiguity set, we prove that the estimation problem is equivalent to a tractable convex … Read more

Distributionally Robust Inverse Covariance Estimation: The Wasserstein Shrinkage Estimator

Published: 2018/05/18

Convex Optimization, Robust Optimization distributionally robust optimization, inverse covariance estimation, wasserstein distance

We introduce a distributionally robust maximum likelihood estimation model with a Wasserstein ambiguity set to infer the inverse covariance matrix of a p-dimensional Gaussian random vector from n independent samples. The proposed model minimizes the worst case (maximum) of Stein’s loss across all normal reference distributions within a prescribed Wasserstein distance from the normal distribution … Read more

Regularization via Mass Transportation

Published: 2017/10/26, Updated: 2019/07/12

Robust Optimization, Stochastic Programming distributionally robust optimization, optimal transport, supervised learning

The goal of regression and classification methods in supervised learning is to minimize the empirical risk, that is, the expectation of some loss function quantifying the prediction error under the empirical distribution. When facing scarce training data, overfitting is typically mitigated by adding regularization terms to the objective that penalize hypothesis complexity. In this paper … Read more

From Data to Decisions: Distributionally Robust Optimization is Optimal

Published: 2017/04/12, Updated: 2020/01/07

Bart Paul Gerard Van Parys