subgradient descent – Optimization Online

Identifiability, the KL property in metric spaces, and subgradient curves

Published: 2022/05/05

Nonsmooth Optimization active manifold, identification, KL property, partly smooth, subgradient descent, variational analysis

Identifiability, and the closely related idea of partial smoothness, unify classical active set methods and more general notions of solution structure. Diverse optimization algorithms generate iterates in discrete time that are eventually confined to identifiable sets. We present two fresh perspectives on identifiability. The first distills the notion to a simple metric property, applicable not … Read more

The structure of conservative gradient fields

Published: 2021/01/03

Adrian Lewis

Tonghua Tian

Nonsmooth Optimization automatic differentiation, clarke subdifferential, conservative field, deep learning, semi-algebraic, stratification, subgradient descent, variational analysis

The classical Clarke subdifferential alone is inadequate for understanding automatic differentiation in nonsmooth contexts. Instead, we can sometimes rely on enlarged generalized gradients called “conservative fields”, defined through the natural path-wise chain rule: one application is the convergence analysis of gradient-based deep learning algorithms. In the semi-algebraic case, we show that all conservative fields are … Read more

A New Perspective on Boosting in Linear Regression via Subgradient Optimization and Relatives

Published: 2015/05/15

Robert M. Freund

Rahul Mazumder

Paul Grigas

Data-Mining, Nonsmooth Optimization, Statistics boosting, lasso, linear regression, steepest descent, subgradient descent

In this paper we analyze boosting algorithms in linear regression from a new perspective: that of modern first-order methods in convex optimization. We show that classic boosting algorithms in linear regression, namely the incremental forward stagewise algorithm (FS-epsilon) and least squares boosting (LS-Boost-epsilon), can be viewed as subgradient descent to minimize the loss function defined … Read more