Shiqian Ma – Optimization Online

Problem-Parameter-Free Decentralized Nonconvex Stochastic Optimization

Published: 2024/02/15

Nonlinear Optimization, Stochastic Programming

Existing decentralized algorithms usually require knowledge of problem parameters for updating local iterates. For example, the hyperparameters (such as learning rate) usually require the knowledge of Lipschitz constant of the global gradient or topological information of the communication networks, which are usually not accessible in practice. In this paper, we propose D-NASA, the first algorithm … Read more

Riemannian Bilevel Optimization

Published: 2024/02/06, Updated: 2024/02/07

Jiaxiang Li

Shiqian Ma

In this work, we consider the bilevel optimization problem on Riemannian manifolds. We inspect the calculation of the hypergradient of such problems on general manifolds and thus enable the utilization of gradient-based algorithms to solve such problems. The calculation of the hypergradient requires utilizing the notion of Riemannian cross-derivative and we inspect the properties and … Read more

AdaBB: Adaptive Barzilai-Borwein Method for Convex Optimization

Published: 2024/01/15

Danqing Zhou

Shiqian Ma

Junfeng Yang

Convex Optimization Adaptive Gradient Descent, Automated Gradient Descent, barzilai-borwein stepsize, Line-Search-Free, Locally Lipschitz Gradient, parameter-free

In this paper, we propose AdaBB, an adaptive gradient method based on the Barzilai-Borwein stepsize. The algorithm is line-search-free and parameter-free, and essentially provides a convergent variant of the Barzilai-Borwein method for general unconstrained convex optimization. We analyze the ergodic convergence of the objective function value and the convergence of the iterates for solving general … Read more

A Single-Loop Algorithm for Decentralized Bilevel Optimization

Published: 2023/11/15, Updated: 2024/04/23

Nonlinear Optimization, Optimization in Data Science bilevel optimization, decentralized optimization

Bilevel optimization has gained significant attention in recent years due to its broad applications in machine learning. This paper focuses on bilevel optimization in decentralized networks and proposes a novel single-loop algorithm for solving decentralized bilevel optimization with a strongly convex lower-level problem. Our approach is a fully single-loop method that approximates the hypergradient using … Read more

Zeroth-order Riemannian Averaging Stochastic Approximation Algorithms

Published: 2023/10/08

Jiaxiang Li

Krishnakumar Balasubramanian

Shiqian Ma

Constrained Nonlinear Optimization, Nonlinear Optimization, Stochastic Programming

We present Zeroth-order Riemannian Averaging Stochastic Approximation (\texttt{Zo-RASA}) algorithms for stochastic optimization on Riemannian manifolds. We show that \texttt{Zo-RASA} achieves optimal sample complexities for generating $\epsilon$-approximation first-order stationary solutions using only one-sample or constant-order batches in each iteration. Our approach employs Riemannian moving-average stochastic gradient estimators, and a novel Riemannian-Lyapunov analysis technique for convergence analysis. … Read more

A New Inexact Proximal Linear Algorithm with Adaptive Stopping Criteria for Robust Phase Retrieval

Published: 2023/04/26, Updated: 2024/02/22

Zhong Zheng

Shiqian Ma

Lingzhou Xue

Nonlinear Optimization, Nonsmooth Optimization complexity, nonconvex and nonsmooth optimization, Proximal Linear Algorithm, Robust Phase Retrieval, Sharpness.

This paper considers the robust phase retrieval problem, which can be cast as a nonsmooth and nonconvex optimization problem. We propose a new inexact proximal linear algorithm with the subproblem being solved inexactly. Our contributions are two adaptive stopping criteria for the subproblem. The convergence behavior of the proposed methods is analyzed. Through experiments on … Read more

A Riemannian ADMM

Published: 2022/11/04, Updated: 2023/05/16

Jiaxiang Li

Shiqian Ma

Tejes Srivastava

Constrained Nonlinear Optimization, Nonlinear Optimization, Nonsmooth Optimization

We consider a class of Riemannian optimization problems where the objective is the sum of a smooth function and a nonsmooth function, considered in the ambient space. This class of problems finds important applications in machine learning and statistics such as the sparse principal component analysis, sparse spectral clustering, and orthogonal dictionary learning. We propose … Read more

Decentralized Stochastic Bilevel Optimization with Improved Per-Iteration Complexity

Published: 2022/10/23

Xuxing Chen

Minhui Huang

Shiqian Ma

Krishnakumar Balasubramanian

Nonlinear Optimization, Optimization in Data Science, Stochastic Programming

Bilevel optimization recently has received tremendous attention due to its great success in solving important machine learning problems like meta learning, reinforcement learning, and hyperparameter optimization. Extending single-agent training on bilevel problems to the decentralized setting is a natural generalization, and there has been a flurry of work studying decentralized bilevel optimization algorithms. However, it … Read more

Federated Learning on Riemannian Manifolds

Published: 2022/06/12

Jiaxiang Li

Shiqian Ma

Nonlinear Optimization distributed optimization, Federated learning, riemannian optimization

Federated learning (FL) has found many important applications in smart-phone-APP based machine learning applications. Although many algorithms have been studied for FL, to the best of our knowledge, algorithms for FL with nonconvex constraints have not been studied. This paper studies FL over Riemannian manifolds, which finds important applications such as federated PCA and federated … Read more

Decentralized Bilevel Optimization

Published: 2022/06/12

Xuxing Chen

Minhui Huang

Shiqian Ma

Nonlinear Optimization bilevel optimization, decentralized optimization, hypergradient estimation

Bilevel optimization has been successfully applied to many important machine learning problems. Algorithms for solving bilevel optimization have been studied under various settings. In this paper, we study the nonconvex-strongly-convex bilevel optimization under a decentralized setting. We design decentralized algorithms for both deterministic and stochastic bilevel optimization problems. Moreover, we analyze the convergence rates of … Read more