Shiqian Ma – Page 2 – Optimization Online

AdaBB: Adaptive Barzilai-Borwein Method for Convex Optimization

Published: 2024/01/15

Convex Optimization Adaptive Gradient Descent, Automated Gradient Descent, barzilai-borwein stepsize, Line-Search-Free, Locally Lipschitz Gradient, parameter-free

In this paper, we propose AdaBB, an adaptive gradient method based on the Barzilai-Borwein stepsize. The algorithm is line-search-free and parameter-free, and essentially provides a convergent variant of the Barzilai-Borwein method for general unconstrained convex optimization. We analyze the ergodic convergence of the objective function value and the convergence of the iterates for solving general … Read more

A Single-Loop Algorithm for Decentralized Bilevel Optimization

Published: 2023/11/15, Updated: 2024/04/23

Nonlinear Optimization, Optimization in Data Science bilevel optimization, decentralized optimization

Bilevel optimization has gained significant attention in recent years due to its broad applications in machine learning. This paper focuses on bilevel optimization in decentralized networks and proposes a novel single-loop algorithm for solving decentralized bilevel optimization with a strongly convex lower-level problem. Our approach is a fully single-loop method that approximates the hypergradient using … Read more

Zeroth-order Riemannian Averaging Stochastic Approximation Algorithms

Published: 2023/10/08

Jiaxiang Li

Krishnakumar Balasubramanian

Shiqian Ma

Constrained Nonlinear Optimization, Nonlinear Optimization, Stochastic Programming

We present Zeroth-order Riemannian Averaging Stochastic Approximation (\texttt{Zo-RASA}) algorithms for stochastic optimization on Riemannian manifolds. We show that \texttt{Zo-RASA} achieves optimal sample complexities for generating $\epsilon$-approximation first-order stationary solutions using only one-sample or constant-order batches in each iteration. Our approach employs Riemannian moving-average stochastic gradient estimators, and a novel Riemannian-Lyapunov analysis technique for convergence analysis. … Read more

A New Inexact Proximal Linear Algorithm with Adaptive Stopping Criteria for Robust Phase Retrieval

Published: 2023/04/26, Updated: 2024/02/22

Zhong Zheng

Shiqian Ma

Lingzhou Xue

Nonlinear Optimization, Nonsmooth Optimization complexity, nonconvex and nonsmooth optimization, Proximal Linear Algorithm, Robust Phase Retrieval, Sharpness.

This paper considers the robust phase retrieval problem, which can be cast as a nonsmooth and nonconvex optimization problem. We propose a new inexact proximal linear algorithm with the subproblem being solved inexactly. Our contributions are two adaptive stopping criteria for the subproblem. The convergence behavior of the proposed methods is analyzed. Through experiments on … Read more

A Riemannian ADMM

Published: 2022/11/04, Updated: 2023/05/16

Jiaxiang Li

Shiqian Ma

Tejes Srivastava

Constrained Nonlinear Optimization, Nonlinear Optimization, Nonsmooth Optimization

We consider a class of Riemannian optimization problems where the objective is the sum of a smooth function and a nonsmooth function, considered in the ambient space. This class of problems finds important applications in machine learning and statistics such as the sparse principal component analysis, sparse spectral clustering, and orthogonal dictionary learning. We propose … Read more

Decentralized Stochastic Bilevel Optimization with Improved Per-Iteration Complexity

Published: 2022/10/23

Xuxing Chen

Minhui Huang

Shiqian Ma

Krishnakumar Balasubramanian

Nonlinear Optimization, Optimization in Data Science, Stochastic Programming

Bilevel optimization recently has received tremendous attention due to its great success in solving important machine learning problems like meta learning, reinforcement learning, and hyperparameter optimization. Extending single-agent training on bilevel problems to the decentralized setting is a natural generalization, and there has been a flurry of work studying decentralized bilevel optimization algorithms. However, it … Read more

Federated Learning on Riemannian Manifolds

Published: 2022/06/12

Jiaxiang Li

Shiqian Ma

Nonlinear Optimization distributed optimization, Federated learning, riemannian optimization

Federated learning (FL) has found many important applications in smart-phone-APP based machine learning applications. Although many algorithms have been studied for FL, to the best of our knowledge, algorithms for FL with nonconvex constraints have not been studied. This paper studies FL over Riemannian manifolds, which finds important applications such as federated PCA and federated … Read more

Decentralized Bilevel Optimization

Published: 2022/06/12

Xuxing Chen

Minhui Huang

Shiqian Ma

Nonlinear Optimization bilevel optimization, decentralized optimization, hypergradient estimation

Bilevel optimization has been successfully applied to many important machine learning problems. Algorithms for solving bilevel optimization have been studied under various settings. In this paper, we study the nonconvex-strongly-convex bilevel optimization under a decentralized setting. We design decentralized algorithms for both deterministic and stochastic bilevel optimization problems. Moreover, we analyze the convergence rates of … Read more

Riemannian Stochastic Proximal Gradient Methods for Nonsmooth Optimization over the Stiefel Manifold

Published: 2022/03/18, Updated: 2022/03/20

Bokun Wang

Shiqian Ma

Lingzhou Xue

Nonlinear Optimization, Statistics

Riemannian optimization has drawn a lot of attention due to its wide applications in practice. Riemannian stochastic first-order algorithms have been studied in the literature to solve large-scale machine learning problems over Riemannian manifolds. However, most of the existing Riemannian stochastic algorithms require the objective function to be differentiable, and they do not apply to … Read more

Efficiently Escaping Saddle Points in Bilevel Optimization

Published: 2022/02/09

Nonlinear Optimization bilevel optimization, Inexact NEON, minimax problem, Saddle Point

Bilevel optimization is one of the fundamental problems in machine learning and optimization. Recent theoretical developments in bilevel optimization focus on finding the first-order stationary points for nonconvex-strongly-convex cases. In this paper, we analyze algorithms that can escape saddle points in nonconvex-strongly-convex bilevel optimization. Specifically, we show that the perturbed approximate implicit differentiation (AID) with … Read more