Convex and Nonsmooth Optimization – Optimization Online

Data-Dependent Complexity of First-Order Methods for Binary Classification

Published: 2025/12/04

Convex Optimization, Optimization in Data Science accelerated first-order methods, binary classification, convergence complexity, convex optimization

Large-scale problems in data science are often modeled with optimization, and the optimization model is usually solved with first-order methods that may converge at a sublinear rate. Therefore, it is of interest to terminate the optimization algorithm as soon as the underlying data science task is accomplished. We consider FISTA for solving two binary classification … Read more

A Theoretical Framework for Auxiliary-Loss-Free Load Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models

Published: 2025/12/02, Updated: 2025/12/03

X.Y. Han

Yuan Zhong

Convex and Nonsmooth Optimization, Optimization in Data Science, Stochastic Programming artificial intelligence, auxiliary loss free load balancing, deepseek, load balancing, online convex optimization, online optimization, primal-dual algorithms, sparse mixture of experts

In large-scale AI training, Sparse Mixture-of-Experts (s-MoE) layers enable scaling by activating only a small subset of experts per token. An operational challenge in this design is load balancing: routing tokens to minimize the number of idle experts, which is important for the efficient utilization of (costly) GPUs. We provide a theoretical framework for analyzing … Read more

Sparse Multiple Kernel Learning: Alternating Best Response and Semidefinite Relaxations

Published: 2025/12/01

Dimitris Bertsimas

Caio de Próspero Iglesias

Nicholas A. G. Johnson

Convex Optimization, Semi-definite Programming, Statistics alternating best response, classification in machine learning, convex relaxation, kernel, sparsity

We study Sparse Multiple Kernel Learning (SMKL), which is the problem of selecting a sparse convex combination of prespecified kernels for support vector binary classification. Unlike prevailing $\ell_1$‐regularized approaches that approximate a sparsifying penalty, we formulate the problem by imposing an explicit cardinality constraint on the kernel weights and add an $\ell_2$ penalty for robustness. … Read more

Subgame Perfect Methods in Nonsmooth Convex Optimization

Published: 2025/12/01

Benjamin Grimmer

Alex L. Wang

Convex Optimization, Nonsmooth Optimization

This paper considers nonsmooth convex optimization with either a subgradient or proximal operator oracle. In both settings, we identify algorithms that achieve the recently introduced game-theoretic optimality notion for algorithms known as subgame perfection. Subgame perfect algorithms meet a more stringent requirement than just minimax optimality. Not only must they provide optimal uniform guarantees on … Read more

On the Convergence of Constrained Gradient Method

Published: 2025/11/21

Constrained Nonlinear Optimization, Convex Optimization constrained optimization, variational inequality problem

The constrained gradient method (CGM) has recently been proposed to solve convex optimization and monotone variational inequality (VI) problems with general functional constraints. While existing literature has established convergence results for CGM, the assumptions employed therein are quite restrictive; in some cases, certain assumptions are mutually inconsistent, leading to gaps in the underlying analysis. This … Read more

Supermodularity, Curvature, and Convex Relaxations in a Class of Quadratic Binary Optimization Problems

Published: 2025/11/20

Bismark Singh

Combinatorial Optimization, Convex and Nonsmooth Optimization convex relaxation, Quadratic optimization, supermodularity, total curvature

We study the combinatorial structure of a quadratic set function $F(S)$ arising from a class of binary optimization models within the family of undesirable facility location problems. Despite strong empirical evidence of nested optimal solutions in previously studied real-world instances, we show that $F(S)$ is, in general, neither submodular nor supermodular. To quantify deviation from … Read more

Min-Max Optimization Is Strictly Easier Than Variational Inequalities

Published: 2025/11/14

Henry Shugart

Jason M. Altschuler

Convex Optimization

Classically, a mainstream approach for solving a convex-concave min-max problem is to instead solve the variational inequality problem arising from its first-order optimality conditions. Is it possible to solve min-max problems faster by bypassing this reduction? This paper initiates this investigation. We show that the answer is yes in the textbook setting of unconstrained quadratic … Read more

A One-Extra Player Reduction of GNEPs to NEPs

Published: 2025/11/12

Henri Lefebvre

David Salas

Martin Schmidt

Convex Optimization, Game Theory, Integer Programming generalized nash equilibrium problems, nash equilibrium problems, penalization, reformulation, Variational inequalities

It is common opinion that generalized Nash equilibrium problems are harder than Nash equilibrium problems. In this work, we show that by adding a new player, it is possible to reduce many generalized problems to standard equilibrium problems. The reduction holds for linear problems and smooth convex problems verifying a Slater-type condition. We also derive … Read more

Preconditioned subgradient method for composite optimization: overparameterization and fast convergence

Published: 2025/11/09

Liwei Jiang

Nonlinear Optimization, Nonsmooth Optimization, Optimization in Data Science composite optimization, Levenberg-Morrison-Marquardt, overparameterization, preconditioned methods, subgradient method

Composite optimization problems involve minimizing the composition of a smooth map with a convex function. Such objectives arise in numerous data science and signal processing applications, including phase retrieval, blind deconvolution, and collaborative filtering. The subgradient method achieves local linear convergence when the composite loss is well-conditioned. However, if the smooth map is, in a … Read more

Lyapunov-based Analysis on First Order Method for Composite Strong-Weak Convex Functions

Published: 2025/11/05, Updated: 2025/11/20

Milan Barik

Suvendu Ranjan Pattanaik

Convex Optimization, Nonsmooth Optimization, Unconstrained Optimization convergence rates, lyapunov analysis, Nestrov accelerated gradient method, proximal gradient method, Ravine method, weakly convex function

The Nesterov’s accelerated gradient (NAG) method generalizes the classical gradient descent algorithm by improving the convergence rate from $\mathcal{O}\left(\frac{1}{t}\right)$ to $\mathcal{O}\left(\frac{1}{t^2}\right)$ in convex optimization. This study examines the proximal gradient framework for additively separable composite functions with smooth and non-smooth components. We demonstrate that Nesterov’s accelerated proximal gradient (NAPG$_\alpha$) method attains a convergence rate of … Read more