continuous state space – Optimization Online

A Q-Learning Algorithm with Continuous State Space

Published: 2006/09/23

Kengy Barty
Pierre Girardeau
Jean-Sebastien Roy
Cyrille Strugarek

Dynamic Programming, Stochastic Programming continuous state space, kernels, q-learning

We study in this paper a Markov Decision Problem (MDP) with continuous state space and discrete decision variables. We propose an extension of the Q-learning algorithm introduced to solve this problem by Watkins in 1989 for completely discrete MDPs. Our algorithm relies on stochastic approximation and functional estimation, and uses kernels to locally update the … Read more