deep learning – Optimization Online

Deep learning and hyperparameter optimization for assessing one’s eligibility for a subcutaneous implantable cardioverter-defibrillator

Published: 2022/11/21

Applications - OR and Management Sciences, Applications - Science and Engineering, Optimization in Data Science deep learning, machine learning, optimization, subcutaneous implantable cardioverter defibrillators

In cardiology, it is standard for patients suffering from ventricular arrhythmias (the leading cause of sudden cardiac death) belonging to high risk populations to be treated using Subcutaneous Implantable Cardioverter-Defibrillators (S-ICDs). S-ICDs carry a risk of so-called T Wave Over Sensing (TWOS), which can lead to inappropriate shocks with an inherent health risk. For this … Read more

Neur2SP: Neural Two-stage Stochastic Programming

Published: 2022/05/25

Stochastic Programming deep learning, supervised learning, two-stage stochastic programming

Stochastic programming is a powerful modeling framework for decision-making under uncertainty. In this work, we tackle two-stage stochastic programs (2SPs), the most widely applied and studied class of stochastic programming models. Solving 2SPs exactly requires evaluation of an expected value function that is computationally intractable. Additionally, having a mixed-integer linear program (MIP) or a nonlinear … Read more

A minibatch stochastic Quasi-Newton method adapted for nonconvex deep learning problems

Published: 2022/01/07, Updated: 2022/01/13

Joshua D. Griffin

Majid Jahani

Martin Takac

Seyedalireza Yektamaram

Wenwen Zhou

Nonlinear Optimization, Unconstrained Optimization deep learning, hessian-free methods, quasi-newton methods

In this study, we develop a limited memory nonconvex Quasi-Newton (QN) method, tailored to deep learning (DL) applications. Since the stochastic nature of (sampled) function information in minibatch processing can affect the performance of QN methods, three strategies are utilized to overcome this issue. These involve a novel progressive trust-region radius update (suitable for stochastic … Read more

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

Published: 2021/12/04, Updated: 2022/03/15

Zih-Syuan Huang

Ching-pei Lee

Nonlinear Optimization, Nonsmooth Optimization deep learning, dual averaging, manifold identification, regularized optimization, variance reduction

This paper proposes an algorithm, RMDA, for training neural networks (NNs) with a regularization term for promoting desired structures. RMDA does not incur computation additional to proximal SGD with momentum, and achieves variance reduction without requiring the objective function to be of the finite-sum form. Through the tool of manifold identification from nonlinear optimization, we … Read more

The structure of conservative gradient fields

Published: 2021/01/03

Adrian Lewis

Tonghua Tian

Nonsmooth Optimization automatic differentiation, clarke subdifferential, conservative field, deep learning, semi-algebraic, stratification, subgradient descent, variational analysis

The classical Clarke subdifferential alone is inadequate for understanding automatic differentiation in nonsmooth contexts. Instead, we can sometimes rely on enlarged generalized gradients called “conservative fields”, defined through the natural path-wise chain rule: one application is the convergence analysis of gradient-based deep learning algorithms. In the semi-algebraic case, we show that all conservative fields are … Read more

Stochastic Multi-level Composition Optimization Algorithms with Level-Independent Convergence Rates

Published: 2020/09/07

Krishnakumar Balasubramanian

Saeed Ghadimi

Anthony Nguyen

Nonlinear Optimization, Statistics, Stochastic Programming deep learning, nonconvex optimization, stochastic approximation, stochsatic composition optimization

In this paper, we study smooth stochastic multi-level composition optimization problems, where the objective function is a nested composition of $T$ functions. We assume access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle. For solving this class of problems, we propose two algorithms using moving-average stochastic estimates, and analyze … Read more

On the Impact of Deep Learning-based Time-series Forecasts on Multistage Stochastic Programming Policies

Published: 2020/08/31, Updated: 2021/09/19

Merve Bodur

Mucahit Cevik

Juyoung Wang

Stochastic Programming autoregressive process, deep learning, multistage stochastic programming, policy evaluation, time series forecasting

Multistage stochastic programming provides a modeling framework for sequential decision-making problems that involve uncertainty. One typically overlooked aspect of this methodology is how uncertainty is incorporated into modeling. Traditionally, statistical forecasting techniques with simple forms, e.g., (first-order) autoregressive time-series models, are used to extract scenarios to be added to optimization models to represent the uncertain … Read more

An Integer Programming Approach to Deep Neural Networks with Binary Activation Functions

Published: 2020/07/06, Updated: 2022/05/13

Bubacarr Bah

Jannis Kurtz

(Mixed) Integer Linear Programming, Other Topics, Robust Optimization adversarial attacks, binary neural network, deep learning, integer programming

We study deep neural networks with binary activation functions (BDNN), i.e. the activation function only has two states. We show that the BDNN can be reformulated as a mixed-integer linear program which can be solved to global optimality by classical integer programming solvers. Additionally, a heuristic solution algorithm is presented and we study the model … Read more

Stochastic generalized gradient methods for training nonconvex nonsmooth neural networks

Published: 2019/09/29

Vladimir I. Norkin

Nonlinear Optimization, Nonsmooth Optimization, Stochastic Programming deep learning, machine learning, multilayer neural networks, nonsmooth nonconvex optimization, stochastic generalized gradient, stochastic optimization

The paper observes a similarity between the stochastic optimal control of discrete dynamical systems and the learning multilayer neural networks. It focuses on contemporary deep networks with nonconvex nonsmooth loss and activation functions. The machine learning problems are treated as nonconvex nonsmooth stochastic optimization problems. As a model of nonsmooth nonconvex dependences, the so-called generalized … Read more

Substantiation of the Backpropagation Technique via the Hamilton-Pontryagin Formalism for Training Nonconvex Nonsmooth Neural Networks

Published: 2019/09/19

Vladimir I. Norkin

Convex and Nonsmooth Optimization, Nonlinear Optimization, Stochastic Programming deep learning, machine learning, multilayer neural networks, nonsmooth nonconvex optimization, stochastic generalized gradient, stochastic optimization

The paper observes the similarity between the stochastic optimal control of discrete dynamical systems and the training multilayer neural networks. It focuses on contemporary deep networks with nonconvex nonsmooth loss and activation functions. In the paper, the machine learning problems are treated as nonconvex nonsmooth stochastic optimization problems. As a model of nonsmooth nonconvex dependences, … Read more