artificial intelligence – Optimization Online

A Theoretical Framework for Auxiliary-Loss-Free Load Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models

Published: 2025/12/02, Updated: 2025/12/03

Convex and Nonsmooth Optimization, Optimization in Data Science, Stochastic Programming artificial intelligence, auxiliary loss free load balancing, deepseek, load balancing, online convex optimization, online optimization, primal-dual algorithms, sparse mixture of experts

In large-scale AI training, Sparse Mixture-of-Experts (s-MoE) layers enable scaling by activating only a small subset of experts per token. An operational challenge in this design is load balancing: routing tokens to minimize the number of idle experts, which is important for the efficient utilization of (costly) GPUs. We provide a theoretical framework for analyzing … Read more

Ellipsoidal Classification via Semidefinite Programming

Published: 2021/10/29

Applications - OR and Management Sciences, Semi-definite Programming artificial intelligence, classification, semidefinite programming

Separating two finite sets of points in a Euclidean space is a fundamental problem in classification. Customarily linear separation is used, but nonlinear separators such as spheres have been shown to have better performances in some tasks, such as edge detection in images. We exploit the relationships between the more general version of the spherical … Read more

Branch-and-Cut-and-Price for Multi-Agent Pathfinding

Published: 2019/08/20

0-1 Programming, Branch and Cut Algorithms, Transportation artificial intelligence, column generation, multi-agent, planning, shortest path

There are currently two broad strategies for optimal Multi-agent Pathfinding (MAPF): (1) search-based methods, which model and solve MAPF directly, and (2) compilation-based solvers, which reduce MAPF to instances of well-known combinatorial problems, and thus, can benefit from advances in solver techniques. In this work, we present an optimal algorithm, BCP, that hybridizes both approaches … Read more

Automated Tuning of Optimization Software Parameters

Published: 2007/10/29, Updated: 2007/11/13

Optimization Software and Modeling Systems artificial intelligence, design of experiments, machine learning, parameter tuning

We present a method to tune software parameters using ideas from software testing and machine learning. The method is based on the key observation that for many classes of instances, the software shows improved performance if a few critical parameters have “good” values, although which parameters are critical depends on the class of instances. Our … Read more

OR Counterparts to AI Planning

Published: 2000/08/22

Robert Fourer

Integer Programming, Optimization Software and Modeling Systems artificial intelligence, constraint programming, integer programming, operations research, planning

The term Planning is not used in Operations Research in the sense that is most common in Artificial Intelligence. AI Planning does have many features in common with OR scheduling, sequencing, routing, and assignment problems, however. Current approaches to solving such problems can be broadly classified into four areas: Combinatorial Optimization, Integer Programming, Constraint Programming, … Read more