approximate dynamic programming – Optimization Online

Guaranteed bounds for optimal stopping problems using kernel-based non-asymptotic uniform confidence bands

Published: 2024/11/22, Updated: 2025/03/18

Dynamic Programming, Optimization in Data Science, Stochastic Programming approximate dynamic programming, Finite sample guarantees, optimal stopping, Reproducing kernel Hilbert spaces, stochastic programming

In this paper, we introduce an approach for obtaining probabilistically guaranteed upper and lower bounds on the true optimal value of stopping problems. Bounds of existing simulation-and-regression approaches, such as those based on least squares Monte Carlo and information relaxation, are stochastic in nature and therefore do not come with a finite sample guarantee. Our … Read more

Multistage Stochastic Facility Location under Facility Disruption Uncertainty

Published: 2024/05/13

Bonn Kleiford Seranilla

Nils Löhndorf

Production and Logistics, Stochastic Programming, Supply Chain Management approximate dynamic programming, Multistage Stochastic Facility Location, Shadow Price Approximation, stochastic dual dynamic integer programming

We consider a multistage variant of the classical stochastic capacitated facility location problem under facility disruption uncertainty. Two solution algorithms for this problem class are presented: (1) stochastic dual dynamic integer programming (SDDiP), the state-of-the-art algorithm for solving multistage stochastic integer programs, and (2) shadow price approximation (SPA), an algorithm utilizing trained parameters of the … Read more

Optimizing Vaccine Distribution in Developing Countries under Natural Disaster Risk

Published: 2022/09/18, Updated: 2022/12/01

Bonn Kleiford Seranilla

Nils Löhndorf

(Mixed) Integer Linear Programming, Dynamic Programming, Stochastic Programming approximate dynamic programming, Healthcare Facility Location Problem, humanitarian logistics, stochastic programming

For many developing countries, COVID-19 vaccination roll-out programs are not only slow but vaccination centers are also exposed to the risk of natural disaster, like flooding, which may slow down vaccination progress even further. Policy-makers in developing countries therefore seek to implement strategies that hedge against distribution risk in order for vaccination campaigns to run … Read more

Randomized Policy Optimization for Optimal Stopping

Published: 2022/03/25

Xinyi Guan

Velibor Mišić

Dynamic Programming, Finance and Economics, Unconstrained Optimization approximate dynamic programming, non-convex optimization, optimal stopping, option pricing, randomization

Optimal stopping is the problem of determining when to stop a stochastic system in order to maximize reward, which is of practical importance in domains such as finance, operations management and healthcare. Existing methods for high-dimensional optimal stopping that are popular in practice produce deterministic linear policies — policies that deterministically stop based on the … Read more

Approximate Dynamic Programming for Crowd-shipping with In-store Customers

Published: 2021/09/27, Updated: 2021/11/18

Civil and Environmental Engineering, Dynamic Programming, Transportation approximate dynamic programming, crowd-shipping, last-mile delivery, markov decision processes, value function approximation

Crowd-shipping has gained significant attention as a last-mile delivery option over the recent years. In this study, we propose a variant of dynamic crowd-shipping model with in-store customers as crowd-shippers to deliver online orders within few hours. We formulate the problem as a Markov decision process and develop an approximate dynamic programming (ADP) policy using … Read more

Random-Sampling Multipath Hypothesis Propagation for Cost Approximation in Long-Horizon Optimal Control

Published: 2020/02/09, Updated: 2020/07/03

Hans D Mittelmann

Shankarachary Ragi

Applications - Science and Engineering, Control Applications approximate dynamic programming, cost approximation, long horizon optimal control, multipath hypothesis propagation

In this paper, we develop a Monte-Carlo based heuristic approach to approximate the objective function in long horizon optimal control problems. In this approach, we evolve the system state over multiple trajectories into the future while sampling the noise disturbances at each time-step, and find the weighted average of the costs along all the trajectories. … Read more

Dynamic optimization for airline maintenance operations

Published: 2019/03/07

Felipe Delgado

Mathias A. Klapp

Carlos Lagos

Airline Optimization airline maintenance, approximate dynamic programming, tail assignment, task scheduling

The occurrence of unexpected aircraft maintenance tasks can produce expensive changes in an airline’s operation. When it comes to critical tasks, it might even cancel programmed flights. Despite of it, the challenge of scheduling aircraft maintenance operations under uncertainty has received limited attention in the scientific literature. We study a dynamic airline maintenance scheduling problem, … Read more

A Dynamic Mobile Production Capacity and Inventory Control Problem

Published: 2018/11/26

Alan L. Erera

Satya S. Malladi

Chelsea C. White III

Applications - OR and Management Sciences, Dynamic Programming, Production and Logistics approximate dynamic programming, joint inventory control and capacity logistics, mobile modular production, rollout heuristic

We analyze a problem of dynamic logistics planning given uncertain demands for a multi-location production-inventory system with transportable modular production capacity. In such systems, production modules provide capacity, and can be moved from one location to another to produce stock and satisfy demand. We formulate a dynamic programming model for a planning problem that considers … Read more

Revisiting Approximate Linear Programming Using a Saddle Point Approach

Published: 2017/06/11, Updated: 2018/06/11

Qihang Lin

Selvaprabu Nadarajah

Negar Soheili

Dynamic Programming, Semi-infinite Programming, Stochastic Programming approximate dynamic programming, approximate linear programming, energy storage, first-order methods, inventory control, markov decision processes

Approximate linear programs (ALPs) are well-known models for computing value function approximations (VFAs) of intractable Markov decision processes (MDPs) arising in applications. VFAs from ALPs have desirable theoretical properties, define an operating policy, and provide a lower bound on the optimal policy cost, which can be used to assess the suboptimality of heuristic policies. However, … Read more