Albert S. Berahas – Page 2 – Optimization Online

Quasi-Newton Methods for Deep Learning: Forget the Past, Just Sample

Published: 2019/02/02, Updated: 2019/05/28

Nonlinear Optimization, Unconstrained Optimization curvature pairs, deep learning, quasi-newton methods, sampling

We present two sampled quasi-Newton methods: sampled LBFGS and sampled LSR1. Contrary to the classical variants of these methods that sequentially build (inverse) Hessian approximations as the optimization progresses, our proposed methods sample points randomly around the current iterate to produce these approximations. As a result, the approximations constructed make use of more reliable (recent … Read more

Derivative-Free Optimization of Noisy Functions via Quasi-Newton Methods

Published: 2018/03/27, Updated: 2019/01/08

Albert S. Berahas

Richard H. Byrd

Jorge Nocedal

Nonlinear Optimization, Unconstrained Optimization derivative-free optimization, nonlinear optimization, stochastic optimization

This paper presents a finite difference quasi-Newton method for the minimization of noisy functions. The method takes advantage of the scalability and power of BFGS updating, and employs an adaptive procedure for choosing the differencing interval h based on the noise estimation techniques of Hamming (2012) and Moré and Wild (2011). This noise estimation procedure … Read more

Balancing Communication and Computation in Distributed Optimization

Published: 2017/10/02, Updated: 2018/05/31

Albert S. Berahas

Raghu Bollapragada

Nitish Shirish Keskar

Ermin Wei

Convex Optimization, Distributed Control, Network Optimization communication, computation, consensus, cost framework, distributed optimization

Methods for distributed optimization have received significant attention in recent years owing to their wide applicability in various domains including machine learning, robotics and sensor networks. A distributed optimization method typically consists of two key components: communication and computation. More specifically, at every iteration (or every several iterations) of a distributed algorithm, each node in … Read more

A Robust Multi-Batch L-BFGS Method for Machine Learning

Published: 2017/07/25, Updated: 2019/03/31

Albert S. Berahas

Martin Takac

Convex Optimization, Unconstrained Optimization consistency, fault-tolerant, l-bfgs, machine learning, multi-batch, sample selection

This paper describes an implementation of the L-BFGS method designed to deal with two adversarial situations. The first occurs in distributed computing environments where some of the computational nodes devoted to the evaluation of the function and gradient are unable to return results on time. A similar challenge occurs in a multi-batch approach in which … Read more

An Investigation of Newton-Sketch and Subsampled Newton Methods

Published: 2017/05/17, Updated: 2019/05/30

Albert S. Berahas

Raghu Bollapragada

Jorge Nocedal

Convex Optimization, Nonlinear Optimization, Stochastic Programming machine learning, newton methods, sketching, subsampling

Sketching, a dimensionality reduction technique, has received much attention in the statistics community. In this paper, we study sketching in the context of Newton’s method for solving finite-sum optimization problems in which the number of variables and data points are both large. We study two forms of sketching that perform dimensionality reduction in data space: … Read more