markov decision processes – Page 3

From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming

Published: 2017/02/20

We consider linear programming (LP) problems in infinite dimensional spaces that are in general computationally intractable. Under suitable assumptions, we develop an approximation bridge from the infinite-dimensional LP to tractable finite convex programs in which the performance of the approximation is quantified explicitly. To this end, we adopt the recent developments in two areas of … Read more

Controlled Markov Decision Processes with AVaR Criteria for Unbounded Costs

Published: 2016/11/27

Kerem Ugurlu

Applications - OR and Management Sciences average value at risk, markov decision processes, optimal control

In this paper, we consider the control problem with the Average-Value-at-Risk (AVaR) criteria of the possibly unbounded L 1 -costs in infinite horizon on a Markov Decision Process (MDP). With a suitable state aggregation and by choosing a priori a global variable s heuristically, we show that there exist optimal policies for the infinite horizon … Read more

Distributionally robust inventory control when demand is a martingale

Published: 2015/11/30, Updated: 2018/08/20

David A. Goldberg

Linwei Xin

Applications - OR and Management Sciences, Robust Optimization, Stochastic Programming demand forecasting, distributionally robust optimization, dynamic programming, inventory control, markov decision processes, martingale

Demand forecasting plays an important role in many inventory control problems. To mitigate the potential harms of model misspecification in this context, various forms of distributionally robust optimization have been applied. Although many of these methodologies suffer from the problem of time-inconsistency, the work of Klabjan, Simchi-Levi and Song [85] established a general time-consistent framework … Read more

Controlled Markov Chains with AVaR Criteria for Unbounded Costs

Published: 2015/09/27, Updated: 2016/03/29

Kerem Ugurlu

Applications - OR and Management Sciences average value at risk, markov decision processes, optimal control

In this paper, we consider the control problem with the Average-Value-at-Risk (AVaR) criteria of the possibly unbounded $L^{1}$-costs in infinite horizon on a Markov Decision Process (MDP). With a suitable state aggregation and by choosing a priori a global variable $s$ heuristically, we show that there exist optimal policies for the infinite horizon problem. To … Read more

Rectangular sets of probability measures

Published: 2014/10/24, Updated: 2015/10/13

Alexander Shapiro

Stochastic Programming coherent risk measures, dynamic programming, markov decision processes, multistage stochastic optimization, rectangularity, risk averse optimization, time consistency

In this paper we consider the notion of rectangularity of a set of probability measures, introduced in Epstein and Schneider (2003), from a somewhat different point of view. We define rectangularity as a property of dynamic decomposition of a distributionally robust stochastic optimization problem and show how it relates to the modern theory of coherent … Read more

Information Relaxation Bounds for Infinite Horizon Markov Decision Processes

Published: 2014/09/06, Updated: 2016/12/30

David B. Brown

Martin Haugh

Dynamic Programming, Scheduling, Stochastic Programming dynamic programming, markov decision processes, multiclass queues

We consider the information relaxation approach for calculating performance bounds for stochastic dynamic programs (DPs), following Brown, Smith, and Sun (2010). This approach generates performance bounds by solving problems with relaxed nonanticipativity constraints and a penalty that punishes violations of these constraints. In this paper, we study infinite horizon DPs with discounted costs and consider … Read more

Singularly Perturbed Markov Decision Processes: A Multiresolution Algorithm

Published: 2013/11/06, Updated: 2014/06/17

Chin Pang Ho

Panos Parpas

Dynamic Programming markov decision processes, multigrid methods, multiscale modeling, weak and strong interactions

Singular perturbation techniques allow the derivation of an aggregate model whose solution is asymptotically optimal for Markov Decision Processes with strong and weak interactions. We develop an algorithm that takes advantage of the asymptotic optimality of the aggregate model in order to compute the solution of the original model with theoretically better complexity than conventional … Read more

Interdiction Games on Markovian PERT Networks

Published: 2013/04/15, Updated: 2014/03/31

Eli Gutin

Daniel Kuhn

Wolfram Wiesemann

Stochastic Programming interdiction game, markov decision processes, pert network, robust optimization

In a stochastic interdiction game a proliferator aims to minimize the expected duration of a nuclear weapons development project, while an interdictor endeavors to maximize the project duration by delaying some of the project tasks. We formulate static and dynamic versions of the interdictor’s decision problem where the interdiction plan is either pre-committed or adapts … Read more

Optimizing Trading Decisions for Hydro Storage Systems using Approximate Dual Dynamic Programming

Published: 2011/12/10, Updated: 2012/09/19

Nils Löhndorf

David Wozabal

Stefan Minner

Applications - OR and Management Sciences, Dynamic Programming, Stochastic Programming approximate dynamic programming, markov decision processes, or in energy, stochastic programming

We propose a new approach to optimize operations of hydro storage systems with multiple connected reservoirs which participate in wholesale electricity markets. Our formulation integrates short-term intraday with long-term interday decisions. The intraday problem considers bidding decisions as well as storage operation during the day and is formulated as a stochastic program. The interday problem … Read more

A New Complexity Result on Solving the Markov Decision Problem

Published: 2004/10/05

Yinyu Ye

Linear Programming dynamic programming, linear programming, markov decision processes

We present a new complexity result on solving the Markov decision problem (MDP) with $n$ states and a number of actions for each state, a special class of real-number linear programs with the Leontief matrix structure. We prove that, when the discount factor $\theta$ is strictly less than $1$, the problem can be solved in … Read more