Pierre Girardeau – Optimization Online

On the convergence of decomposition methods for multi-stage stochastic convex programs

Published: 2012/04/24, Updated: 2013/05/25

Convex Optimization, Dynamic Programming, Stochastic Programming benders decomposition, dynamic programming, monte carlo sampling, stochastic dual dynamic programming algorithm, stochastic programming

We prove the almost-sure convergence of a class of sampling-based nested decomposition algorithms for multistage stochastic convex programs in which the stage costs are general convex functions of the decisions, and uncertainty is modelled by a scenario tree. As special cases, our results imply the almost-sure convergence of SDDP, CUPPS and DOASA when applied to … Read more

A comparison of sample-based Stochastic Optimal Control methods

Published: 2010/02/10

Pierre Girardeau

Stochastic Programming particle methods, stochastic optimal control, stochastic programming

In this paper, we compare the performance of two scenario-based numerical methods to solve stochastic optimal control problems: scenario trees and particles. The problem consists in finding strategies to control a dynamical system perturbed by exogenous noises so as to minimize some expected cost along a discrete and finite time horizon. We introduce the Mean … Read more

Decomposition of large-scale stochastic optimal control problems

Published: 2009/02/12, Updated: 2009/03/06

Kengy Barty

Pierre Carpentier

Pierre Girardeau

Dynamic Programming, Stochastic Programming decomposition, dynamic programming, stochastic optimal control

In this paper, we present an Uzawa-based heuristic that is adapted to some type of stochastic optimal control problems. More precisely, we consider dynamical systems that can be divided into small-scale independent subsystems, though linked through a static almost sure coupling constraint at each time step. This type of problem is common in production/portfolio management … Read more

A Q-Learning Algorithm with Continuous State Space

Published: 2006/09/23

Dynamic Programming, Stochastic Programming continuous state space, kernels, q-learning

We study in this paper a Markov Decision Problem (MDP) with continuous state space and discrete decision variables. We propose an extension of the Q-learning algorithm introduced to solve this problem by Watkins in 1989 for completely discrete MDPs. Our algorithm relies on stochastic approximation and functional estimation, and uses kernels to locally update the … Read more