reinforcement learning – Optimization Online

AI for Enhancing Operations Research of Agriculture and Energy

Published: 2025/12/29

(Mixed) Integer Linear Programming, Applications - OR and Management Sciences, Global Optimization Agricultural Planning, energy systems, Learning-Enhanced Optimization, mixed-integer nonlinear programming, optimal power flow, reinforcement learning, stochastic optimization, unit commitment

This paper surveys optimization problems arising in agriculture, energy systems, and water-energy coordination from an operations research perspective. These problems are commonly formulated as integer nonlinear programs, mixed-integer nonlinear programs, or combinatorial set optimization models, characterized by nonlinear physical constraints, discrete decisions, and intertemporal coupling. Such structures pose significant computational challenges in large-scale and repeated-solution … Read more

Machine Learning Algorithms for Assisting Solvers for Constraint Satisfaction Problems

Published: 2025/11/09

Morteza Kimiaei

Vyacheslav Kungurtsev

Applications - OR and Management Sciences, Combinatorial Optimization, Optimization in Data Science Boolean Satisfiability, Conflict-Driven Clause Learning, Constraint Satisfaction Problem, Graph Neural Networks, Hybrid Solvers, Lazy Clause Generation, Learned Heuristics, machine learning, Neuro-Symbolic Optimization, reinforcement learning, Transformer Models

This survey proposes a unifying conceptual framework and taxonomy that systematically integrates Machine Learning (ML) and Reinforcement Learning (RL) with classical paradigms for Constraint Satisfaction and Boolean Satisfiability solving. Unlike prior reviews that focus on individual applications, we organize the literature around solver architecture, linking each major phase—constraint propagation, heuristic decision-making, conflict analysis, and meta-level … Read more

Machine Learning Algorithms for Assisting Solvers for Decision Optimization Problems

Published: 2025/11/09

Morteza Kimiaei

Vyacheslav Kungurtsev

Applications - OR and Management Sciences, Integer Programming, Optimization in Data Science backward induction, combinatorial optimization, disjunctive programming, global optimization, machine learning, neural networks, probabilistic programming, reinforcement learning, submodular optimization

Combinatorial decision problems lie at the intersection of Operations Research (OR) and Artificial Intelligence (AI), encompassing structured optimization tasks such as submodular selection, dynamic programming, planning, and scheduling. These problems exhibit exponential growth in decision complexity, driven by interdependent choices coupled through logical, temporal, and resource constraints. Classical optimization frameworks—including integer programming, submodular optimization, and … Read more

Machine Learning Algorithms for Improving Black Box Optimization Solvers

Published: 2025/09/29

Morteza Kimiaei

Vyacheslav Kungurtsev

Global Optimization, Nonlinear Optimization, Optimization in Data Science black-box optimization, machine learning, Meta-Black-Box Optimization, reinforcement learning, robust optimization, surrogate models

Black-box optimization (BBO) addresses problems where objectives are accessible only through costly queries without gradients or explicit structure. Classical derivative-free methods—line search, direct search, and model-based solvers such as Bayesian optimization—form the backbone of BBO, yet often struggle in high-dimensional, noisy, or mixed-integer settings. Recent advances use machine learning (ML) and reinforcement learning (RL) to … Read more

Data-Driven Multistage Scheduling Optimization for Refinery Production under Uncertainty: Systematic Framework, Modeling Approach, and Application Analysis

Published: 2025/05/29

Biao Han

Control Applications, Scheduling, Stochastic Programming control and scheduling, data-driven, julia, mutistage, refinery scheduling, reinforcement learning, stochastic programming, uncertainty

The widespread existence of various uncertainties makes the inherently complex refinery production scheduling problem even more challenging. To address this issue, this paper proposes a viable systematic data-driven multistage scheduling optimization framework and develops a corresponding structured modeling methodology. Under this paradigm, unit-level advanced control and plant-level intelligent scheduling are coordinated to jointly deal with … Read more

Towards Optimal Offline Reinforcement Learning

Published: 2025/03/15

Mengmeng Li

Daniel Kuhn

Tobias Sutter

Robust Optimization large deviations theory, reinforcement learning

We study offline reinforcement learning problems with a long-run average reward objective. The state-action pairs generated by any fixed behavioral policy thus follow a Markov chain, and the empirical state-action-next-state distribution satisfies a large deviations principle. We use the rate function of this large deviations principle to construct an uncertainty set for the unknown true … Read more

From Optimization to Control: Quasi Policy Iteration

Published: 2023/11/27, Updated: 2025/08/26

Mohamad Amin Sharifi Kolarijani

Peyman Mohajerin Esfahani

Convex Optimization, Dynamic Programming dynamic programming, markov decision processes, optimization algorithms, quasi-newton methods, reinforcement learning

Recent control algorithms for Markov decision processes (MDPs) have been designed using an implicit analogy with well-established optimization algorithms. In this paper, we adopt the quasi-Newton method (QNM) from convex optimization to introduce a novel control algorithm coined as quasi-policy iteration (QPI). In particular, QPI is based on a novel approximation of the “Hessian” matrix … Read more

Dynamic courier capacity acquisition in rapid delivery systems: a deep Q-learning approach

Published: 2022/01/25

Ramon Auad

Alan L. Erera

Martin Savelsbergh

Applications - OR and Management Sciences, Production and Logistics, Transportation capacity management, Deep Q-learning, last-mile delivery, logistics, Rapid delivery, reinforcement learning

With the recent boom of the gig economy, urban delivery systems have experienced substantial demand growth. In such systems, orders are delivered to customers from local distribution points respecting a delivery time promise. An important example is a restaurant meal delivery system, where delivery times are expected to be minutes after an order is placed. … Read more

Batch Learning in Stochastic Dual Dynamic Programming

Published: 2021/05/17

Daniel Ávila

Nils Löhndorf

Anthony Papavasiliou

Dynamic Programming, Stochastic Programming dynamic programming, parallel computing, reinforcement learning, sddp, stochastic programming

We consider the stochastic dual dynamic programming (SDDP) algorithm, which is a widely employed algorithm applied to multistage stochastic programming, and propose a variant using batch learning, a technique used with success in the reinforcement learning framework. We cast SDDP as a type of Q-learning algorithm and describe its application in both risk neutral and … Read more

An Adaptive and Near Parameter-free BRKGA Using Q-Learning Method

Published: 2021/02/22, Updated: 2021/04/21

Antonio Chaves

Luiz Henrique Lorena

Meta Heuristics genetic algorithm, parameter control, q-learning, reinforcement learning

The Biased Random-Key Genetic Algorithm (BRKGA) is an efficient metaheuristic to solve combinatorial optimization problems but requires parameter tuning so the intensification and diversification of the algorithm work in a balanced way. There is, however, not only one optimal parameter configuration, and the best configuration may differ according to the stages of the evolutionary process. … Read more