stochastic optimization – Page 6

An Analysis of Constant Step Size SGD in the Non-convex Regime: Asymptotic Normality and Bias

Published: 2020/06/14, Updated: 2020/06/15

Statistics, Stochastic Approaches, Stochastic Programming asymptotic normality, constant step-size sgd, stochastic optimization

Structured non-convex learning problems, for which critical points have favorable statistical properties, arise frequently in statistical machine learning. Algorithmic convergence and statistical estimation rates are well-understood for such problems. However, quantifying the uncertainty associated with the underlying training algorithm is not well-studied in the non-convex setting. In order to address this short-coming, in this work, … Read more

A Framework for Adaptive Open-pit Mining Planning under Geological Uncertainty

Published: 2020/04/15

Dynamic Programming, Scheduling, Stochastic Programming adaptive algorithms, geostatistics, learning, mine planning, stochastic optimization

Mine planning optimization aims at maximizing the profit obtained from extracting valuable ore. Beyond its theoretical complexity (the open-pit mining problem with capacity constraints reduces to a knapsack problem with precedence constraints, which is NP-hard), practical instances of the problem usually involve a large to very large number of decision variables, typically of the order … Read more

Expected complexity analysis of stochastic direct-search

Published: 2020/03/06

Kwassi Joseph Dzahini

Nonlinear Optimization, Stochastic Approaches, Unconstrained Optimization blackbox optimization, convergence rate, derivative-free optimization, direct search, stochastic optimization, stochastic processes

This work presents the convergence rate analysis of stochastic variants of the broad class of direct-search methods of directional type. It introduces an algorithm designed to optimize differentiable objective functions $f$ whose values can only be computed through a stochastically noisy blackbox. The proposed stochastic directional direct-search (SDDS) algorithm accepts new iterates by imposing a … Read more

Zero Order Stochastic Weakly Convex Composite Optimization

Published: 2020/02/19

Vyacheslav Kungurtsev

Francesco Rinaldi

Nonlinear Optimization, Nonsmooth Optimization, Stochastic Programming derivative-free optimization, stochastic optimization, weakly convex functions, zero order optimization

In this paper we consider stochastic weakly convex composite problems, however without the existence of a stochastic subgradient oracle. We present a derivative free algorithm that uses a two point approximation for computing a gradient estimate of the smoothed function. We prove convergence at a similar rate as state of the art methods, however with … Read more

A Fully Stochastic Second-Order Trust Region Method

Published: 2019/11/15

Frank E. Curtis

Rui Shi

Nonlinear Optimization, Stochastic Programming, Unconstrained Optimization deep neural networks, finite-sum optimization, machine learning, stochastic newton methods, stochastic optimization, time series forecasting, trust-region methods

A stochastic second-order trust region method is proposed, which can be viewed as a second-order extension of the trust-region-ish (TRish) algorithm proposed by Curtis et al. [INFORMS J. Optim. 1(3) 200–220, 2019]. In each iteration, a search direction is computed by (approximately) solving a trust region subproblem defined by stochastic gradient and Hessian estimates. The … Read more

Adaptive Sampling Quasi-Newton Methods for Derivative-Free Stochastic Optimization

Published: 2019/10/31

Raghu Bollapragada

Stefan M. Wild

Convex Optimization, Nonlinear Optimization, Stochastic Programming adaptive sampling, derivative-free optimization, stochastic optimization

We consider stochastic zero-order optimization problems, which arise in settings from simulation optimization to reinforcement learning. We propose an adaptive sampling quasi-Newton method where we estimate the gradients of a stochastic function using finite differences within a common random number framework. We employ modified versions of a norm test and an inner product quasi-Newton test … Read more

Admissibility of solution estimators for stochastic optimization

Published: 2019/10/05

Amitabh Basu

Tu Nguyen

Ao Sun

Stochastic Programming admissibility, statistical decision theory, stein's paradox, stochastic optimization

We look at stochastic optimization problems through the lens of statistical decision theory. In particular, we address admissibility, in the statistical decision theory sense, of the natural sample average estimator for a stochastic optimization problem (which is also known as the empirical risk minimization (ERM) rule in learning literature). It is well known that for … Read more

Stochastic mesh adaptive direct search for blackbox optimization using probabilistic estimates

Published: 2019/10/02, Updated: 2022/05/13

Charles Audet

Kwassi Joseph Dzahini

Michael Kokkolaras

Sébastien Le Digabel

Nonlinear Optimization, Other Topics, Unconstrained Optimization blackbox optimization, derivative-free optimization, mesh adaptive direct search, probabilistic estimates, stochastic optimization

We present a stochastic extension of the mesh adaptive direct search (MADS) algorithm originally developed for deterministic blackbox optimization. The algorithm, called StoMADS, considers the unconstrained optimization of an objective function f whose values can be computed only through a blackbox corrupted by some random noise following an unknown distribution. The proposed method is based … Read more

Stochastic generalized gradient methods for training nonconvex nonsmooth neural networks

Published: 2019/09/29

Vladimir I. Norkin

Nonlinear Optimization, Nonsmooth Optimization, Stochastic Programming deep learning, machine learning, multilayer neural networks, nonsmooth nonconvex optimization, stochastic generalized gradient, stochastic optimization

The paper observes a similarity between the stochastic optimal control of discrete dynamical systems and the learning multilayer neural networks. It focuses on contemporary deep networks with nonconvex nonsmooth loss and activation functions. The machine learning problems are treated as nonconvex nonsmooth stochastic optimization problems. As a model of nonsmooth nonconvex dependences, the so-called generalized … Read more

Substantiation of the Backpropagation Technique via the Hamilton-Pontryagin Formalism for Training Nonconvex Nonsmooth Neural Networks

Published: 2019/09/19

Vladimir I. Norkin

Convex and Nonsmooth Optimization, Nonlinear Optimization, Stochastic Programming deep learning, machine learning, multilayer neural networks, nonsmooth nonconvex optimization, stochastic generalized gradient, stochastic optimization

The paper observes the similarity between the stochastic optimal control of discrete dynamical systems and the training multilayer neural networks. It focuses on contemporary deep networks with nonconvex nonsmooth loss and activation functions. In the paper, the machine learning problems are treated as nonconvex nonsmooth stochastic optimization problems. As a model of nonsmooth nonconvex dependences, … Read more