policy search – Optimization Online

Reinforcement Learning via Parametric Cost Function Approximation for Multistage Stochastic Programming

Published: 2018/12/27, Updated: 2019/10/06

Applications - OR and Management Sciences, Optimization of Simulated Systems, Stochastic Programming parametric cost function approximation, policy search, simulation-based optimization, stochastic optimization, stochastic programming

The most common approaches for solving stochastic resource allocation problems in the research literature is to either use value functions (“dynamic programming”) or scenario trees (“stochastic programming”) to approximate the impact of a decision now on the future. By contrast, common industry practice is to use a deterministic approximation of the future which is easier … Read more