large deviations theory – Optimization Online

Towards Optimal Offline Reinforcement Learning

Published: 2025/03/15

We study offline reinforcement learning problems with a long-run average reward objective. The state-action pairs generated by any fixed behavioral policy thus follow a Markov chain, and the empirical state-action-next-state distribution satisfies a large deviations principle. We use the rate function of this large deviations principle to construct an uncertainty set for the unknown true … Read more

Reduced Sample Complexity in Scenario-Based Control System Design via Constraint Scaling

Published: 2024/11/30

Jaeseok Choi

Constantino Lagoa

Anirudh Subramanyam

Control Applications, Convex Optimization, Stochastic Programming chance-constrained programming, large deviations theory, scenario approach in optimization

The scenario approach is widely used in robust control system design and chance-constrained optimization, maintaining convexity without requiring assumptions about the probability distribution of uncertain parameters. However, the approach can demand large sample sizes, making it intractable for safety-critical applications that require very low levels of constraint violation. To address this challenge, we propose a … Read more

From Data to Decisions: Distributionally Robust Optimization is Optimal

Published: 2017/04/12, Updated: 2020/01/07

Daniel Kuhn

Bart Paul Gerard Van Parys

Peyman Mohajerin Esfahani

Robust Optimization, Second-Order Cone Programming, Stochastic Programming convex optimization, data-driven optimization, distributionally robust optimization, large deviations theory, observed fisher information, relative entropy

We study stochastic programs where the decision-maker cannot observe the distribution of the exogenous uncertainties but has access to a finite set of independent samples from this distribution. In this setting, the goal is to find a procedure that transforms the data to an estimate of the expected cost function under the unknown data-generating distribution, … Read more