Raghu Bollapragada – Page 2 – Optimization Online

Adaptive Sampling Quasi-Newton Methods for Derivative-Free Stochastic Optimization

Published: 2019/10/31

We consider stochastic zero-order optimization problems, which arise in settings from simulation optimization to reinforcement learning. We propose an adaptive sampling quasi-Newton method where we estimate the gradients of a stochastic function using finite differences within a common random number framework. We employ modified versions of a norm test and an inner product quasi-Newton test … Read more

A Progressive Batching L-BFGS Method for Machine Learning

Published: 2018/02/14

Convex Optimization, Nonlinear Optimization, Stochastic Programming deep learning, nonconvex optimization, sample selection, stochastic optimization

The standard L-BFGS method relies on gradient approximations that are not dominated by noise, so that search directions are descent directions, the line search is reliable, and quasi-Newton updating yields useful quadratic models of the objective function. All of this appears to call for a full batch approach, but since small batch sizes give rise … Read more

Adaptive Sampling Strategies for Stochastic Optimization

Published: 2017/10/30

Raghu Bollapragada

Richard H. Byrd

Jorge Nocedal

Convex Optimization, Nonlinear Optimization, Stochastic Programming adaptive sampling, machine learning, stochastic optimization

In this paper, we propose a stochastic optimization method that adaptively controls the sample size used in the computation of gradient approximations. Unlike other variance reduction techniques that either require additional storage or the regular computation of full gradients, the proposed method reduces variance by increasing the sample size as needed. The decision to increase … Read more

Balancing Communication and Computation in Distributed Optimization

Published: 2017/10/02, Updated: 2018/05/31

Albert S. Berahas

Raghu Bollapragada

Nitish Shirish Keskar

Ermin Wei

Convex Optimization, Distributed Control, Network Optimization communication, computation, consensus, cost framework, distributed optimization

Methods for distributed optimization have received significant attention in recent years owing to their wide applicability in various domains including machine learning, robotics and sensor networks. A distributed optimization method typically consists of two key components: communication and computation. More specifically, at every iteration (or every several iterations) of a distributed algorithm, each node in … Read more

An Investigation of Newton-Sketch and Subsampled Newton Methods

Published: 2017/05/17, Updated: 2019/05/30

Albert S. Berahas

Raghu Bollapragada

Jorge Nocedal

Convex Optimization, Nonlinear Optimization, Stochastic Programming machine learning, newton methods, sketching, subsampling

Sketching, a dimensionality reduction technique, has received much attention in the statistics community. In this paper, we study sketching in the context of Newton’s method for solving finite-sum optimization problems in which the number of variables and data points are both large. We study two forms of sketching that perform dimensionality reduction in data space: … Read more

Exact and Inexact Subsampled Newton Methods for Optimization

Published: 2016/09/27

Raghu Bollapragada

Richard H. Byrd

Jorge Nocedal

Convex Optimization, Nonlinear Optimization, Stochastic Programming machine learning, subsampling

The paper studies the solution of stochastic optimization problems in which approximations to the gradient and Hessian are obtained through subsampling. We first consider Newton-like methods that employ these approximations and discuss how to coordinate the accuracy in the gradient and Hessian to yield a superlinear rate of convergence in expectation. The second part of … Read more