Dynamic Programming – Page 14 – Optimization Online

MIDAS: A Mixed Integer Dynamic Approximation Scheme

Published: 2016/05/07, Updated: 2016/06/15

Mixed Integer Dynamic Approximation Scheme (MIDAS) is a new sampling-based algorithm for solving finite-horizon stochastic dynamic programs with monotonic Bellman functions. MIDAS approximates these value functions using step functions, leading to stage problems that are mixed integer programs. We provide a general description of MIDAS, and prove its almost-sure convergence to an epsilon-optimal policy when … Read more

A Polyhedral Approach to Online Bipartite Matching

Published: 2016/04/11, Updated: 2017/07/12

Shabbir Ahmed

Alejandro Toriello

Alfredo Torrico

Dynamic Programming, Polyhedra dynamic programming, online matching, polyhedral relaxation

We study the i.i.d. online bipartite matching problem, a dynamic version of the classical model where one side of the bipartition is fixed and known in advance, while nodes from the other side appear one at a time as i.i.d. realizations of a uniform distribution, and must immediately be matched or discarded. We consider various … Read more

A joint routing and speed optimization problem

Published: 2016/02/26, Updated: 2017/03/05

(Mixed) Integer Nonlinear Programming, Dynamic Programming, Transportation

Fuel cost contributes to a significant portion of operating cost in cargo transportation. Though classic routing models usually treat fuel cost as input data, fuel consumption heavily depends on the travel speed, which has led to the study of optimizing speeds over a given fixed route. In this paper, we propose a joint routing and … Read more

A Deterministic Fully Polynomial Time Approximation Scheme For Counting Integer Knapsack Solutions Made Easy

Published: 2015/12/03, Updated: 2016/07/03

Nir Halman

Approximation Algorithms, Dynamic Programming approximate counting, dynamic programming, integer knapsack

Given $n$ elements with nonnegative integer weights $w=(w_1,\ldots,w_n)$, an integer capacity $C$ and positive integer ranges $u=(u_1,\ldots,u_n)$, we consider the counting version of the classic integer knapsack problem: find the number of distinct multisets whose weights add up to at most $C$. We give a deterministic algorithm that estimates the number of solutions to within … Read more

The Budgeted Minimum Cost Flow Problem with Unit Upgrading Cost

Published: 2015/10/27

Combinatorial Optimization, Dynamic Programming, Network Optimization bilinear problem, budgeted optimization, complexity, minimum cost ow

The budgeted minimum cost flow problem (BMCF(K)) with unit upgrading costs extends the classical minimum cost flow problem by allowing to reduce the cost of at most K arcs. In this paper, we consider complexity and algorithms for the special case of an uncapacitated network with just one source. By a reduction from 3-SAT we … Read more

A Quantitative Comparison of Risk Measures

Published: 2015/09/14

Alois Pichler

Applications - OR and Management Sciences, Dynamic Programming, Stochastic Programming dual representation, risk measures

The choice of a risk measure reflects a subjective preference of the decision maker in many managerial, or real world economic problem formulations. To evaluate the impact of personal preferences it is thus of interest to have comparisons with other risk measures at hand. This paper develops a framework for comparing different risk measures. We … Read more

Nonstationary Direct Policy Search for Risk-Averse Stochastic Optimization

Published: 2015/09/12, Updated: 2017/05/06

Belgacem Bouzaiene-Ayari

Somayeh Moazeni

Boris Defourny

Warren B. Powell

Dynamic Programming, Parallel Algorithms, Stochastic Programming cost function approximation (cfa) policy, dynamic programming, stochastic optimization

This paper presents an approach to non-stationary policy search for finite-horizon, discrete-time Markovian decision problems with large state spaces, constrained action sets, and a risk-sensitive optimality criterion. The methodology relies on modeling time variant policy parameters by a non-parametric response surface model for an indirect parametrized policy motivated by the Bellman equation. Through the interpolating … Read more

Semi-Infinite Relaxations for the Dynamic Knapsack Problem with Stochastic Item Sizes

Published: 2015/08/20

Daniel Blado

Weihong Hu

Alejandro Toriello

0-1 Programming, Dynamic Programming dynamic programming, relaxation, stochastic knapsack

We consider a version of the knapsack problem in which an item size is random and revealed only when the decision maker attempts to insert it. After every successful insertion the decision maker can choose the next item dynamically based on the remaining capacity and available items, while an unsuccessful insertion terminates the process. We … Read more

Fully Polynomial Time hBcApproximation Schemes for Continuous Stochastic Convex Dynamic Programs

Published: 2015/07/07

Nir Halman

Giacomo Nannicini

Approximation Algorithms, Dynamic Programming approximation algorithms, hBcapproximation sets and functions, stochastic dynamic programming

We develop fully polynomial time $(\Sigma,\Pi)$-approximation schemes for stochastic dynamic programs with continuous state and action spaces, when the single-period cost functions are convex Lipschitz-continuous functions that are accessed via value oracle calls. That is, for every given additive error parameter $\Sigma>0$ and multiplicative error factor $\Pi=1+\epsilon>1$, the scheme returns a feasible solution whose value … Read more

Provably Near-Optimal Approximation Schemes for Implicit Stochastic and for Sample-Based Dynamic Programs

Published: 2015/06/08, Updated: 2023/09/01

Nir Halman

Dynamic Programming approximation algorithms, hBcapproximation sets and functions, inventory control, sample average approximation

In this paper we address two models of non-deterministic discrete-time finite-horizon dynamic programs (DPs): implicit stochastic DPs – the information about the random events is given by value oracles to their CDFs; and sample-based DPs – the information about the random events is deduced via samples. In both models the single period cost functions are … Read more