A Generalization Result for Convergence in Learning-to-Optimize

Published: 2024/10/10

Convex and Nonsmooth Optimization, Nonsmooth Optimization, Optimization in Data Science learning-to-optimize, pac-bayes, stationary points Short URL: https://optimization-online.org/?p=27980

Convergence in learning-to-optimize is hardly studied, because conventional convergence guarantees in optimization are based on geometric arguments, which cannot be applied easily to learned algorithms. Thus, we develop a probabilistic framework that resembles deterministic optimization and allows for transferring geometric arguments into learning-to-optimize. Our main theorem is a generalization result for parametric classes of potentially non-smooth, non-convex loss functions and establishes the convergence of learned optimization algorithms to stationary points with high probability. This can be seen as a statistical counterpart to the use of geometric safeguards to ensure convergence. To the best of our knowledge, we are the first to prove convergence of optimization algorithms in such a probabilistic framework.

Article

Download

View PDF