Thiago Serra – Optimization Online

When Deep Learning Meets Polyhedral Theory: A Survey

Published: 2023/04/29, Updated: 2023/08/31

(Mixed) Integer Linear Programming, Optimization in Data Science, Polyhedra

In the past decade, deep learning became the prevalent methodology for predictive modeling thanks to the remarkable accuracy of deep neural networks in tasks such as computer vision and natural language processing. Meanwhile, the structure of neural networks converged back to simpler representations based on piecewise constant and piecewise linear functions such as the Rectified … Read more

The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Published: 2022/03/10

Data-Mining, Meta Heuristics, Quadratic Programming loss function, network pruning, neural networks

Neural networks tend to achieve better accuracy with training if they are larger — even if the resulting models are overparameterized. Nevertheless, carefully removing such excess parameters before, during, or after training may also produce models with similar or even improved accuracy. In many cases, that can be curiously achieved by heuristics as simple as … Read more

Scaling Up Exact Neural Network Compression by ReLU Stability

Published: 2021/02/16

Abhinav Kumar

Srikumar Ramalingam

Thiago Serra

(Mixed) Integer Linear Programming, Applications - Science and Engineering, Combinatorial Optimization

We can compress a neural network while exactly preserving its underlying functionality with respect to a given input domain if some of its neurons are stable. However, current approaches to determine the stability of neurons in networks with Rectified Linear Unit (ReLU) activations require solving or finding a good approximation to multiple discrete optimization problems. … Read more

Lossless Compression of Deep Neural Networks

Published: 2020/01/01, Updated: 2020/01/24

Abhinav Kumar

Srikumar Ramalingam

Thiago Serra

(Mixed) Integer Linear Programming, Applications - Science and Engineering

Deep neural networks have been successful in many predictive modeling tasks, such as image and language recognition, where large neural networks are often used to obtain good accuracy. Consequently, it is challenging to deploy these networks under limited computational resources, such as in mobile devices. In this work, we introduce an algorithm that removes units … Read more

Template-based Minor Embedding for Adiabatic Quantum Optimization

Published: 2019/10/09, Updated: 2021/01/25

David Bergman

Arvind U. Raghunathan

Thiago Serra

Teng Huang

0-1 Programming, Applications - Science and Engineering

Quantum Annealing (QA) can be used to quickly obtain near-optimal solutions for Quadratic Unconstrained Binary Optimization (QUBO) problems. In QA hardware, each decision variable of a QUBO should be mapped to one or more adjacent qubits in such a way that pairs of variables defining a quadratic term in the objective function are mapped to … Read more

Empirical Bounds on Linear Regions of Deep Rectifier Networks

Published: 2018/10/09, Updated: 2020/01/24

Srikumar Ramalingam

Thiago Serra

(Mixed) Integer Linear Programming, Statistics

One form of characterizing the expressiveness of a piecewise linear neural network is by the number of linear regions, or pieces, of the function modeled. We have observed substantial progress in this topic through lower and upper bounds on the maximum number of linear regions and a counting procedure. However, these bounds only account for … Read more

When Lift-and-Project Cuts are Different

Published: 2018/09/17, Updated: 2020/01/24

Egon Balas

Thiago Serra

Cutting Plane Approaches

In this paper, we present a method to determine if a lift-and-project cut for a mixed-integer linear program is regular, in which case the cut is equivalent to an intersection cut from some basis of the linear relaxation. This is an important question due to the intense research activity for the past decade on cuts … Read more

Seamless Multimodal Transportation Scheduling

Published: 2018/07/26

David Bergman

John Hooker

Arvind U. Raghunathan

Thiago Serra

Shingo Kobori

Applications - OR and Management Sciences, Scheduling, Transportation

Ride-hailing services have expanded the role of shared mobility in passenger transportation systems, creating new markets and creative planning solutions for major urban centers. In this paper, we consider their use for last-mile passenger transportation in coordination with a mass transit service to provide a seamless multimodal transportation experience for the user. A system that … Read more

The Integrated Last-Mile Transportation Problem

Published: 2018/06/21

David Bergman

John Hooker

Arvind U. Raghunathan

Thiago Serra

(Mixed) Integer Linear Programming, Scheduling

Last-mile transportation (LMT) refers to any service that moves passengers from a hub of mass transportation (MT), such as air, boat, bus, or train, to destinations, such as a home or an office. In this paper, we introduce the problem of scheduling passengers jointly on MT and LMT services, with passengers sharing a car, van, … Read more

Bounding and Counting Linear Regions of Deep Neural Networks

Published: 2018/01/08, Updated: 2018/06/09

Srikumar Ramalingam

Thiago Serra

Christian Tjandraatmadja

(Mixed) Integer Linear Programming, Statistics deep learning, linear regions, mixed-integer linear programming, piecewise-linear activations, solution couting

We investigate the complexity of deep neural networks (DNN) that represent piecewise linear (PWL) functions. In particular, we study the number of linear regions, i.e. pieces, that a PWL function represented by a DNN can attain, both theoretically and empirically. We present (i) tighter upper and lower bounds for the maximum number of linear regions … Read more