Optimization in Data Science – Page 5

Multilevel Objective-Function-Free Optimization with an Application to Neural Networks Training

Published: 2023/02/14

Data Science Algorithms, Nonlinear Optimization, Nonlinear Systems and Least-Squares evaluation complexity, multilevel optimization, neural network training, OFFO methods

A class of multi-level algorithms for unconstrained nonlinear optimization is presented which does not require the evaluation of the objective function. The class contains the momentum-less AdaGrad method as a particular (single-level) instance. The choice of avoiding the evaluation of the objective function is intended to make the algorithms of the class less sensitive to … Read more

A classification method based on a cloud of spheres

Published: 2023/02/05, Updated: 2023/08/22

Tiago Dias

Paula Alexandra Amaral

(Mixed) Integer Nonlinear Programming, Data Science Algorithms, Quadratic Programming classification in machine learning, MINLP, Spherical separation, supervised classification

\(\) In this article we propose a binary classification model to distinguish a specific class that corresponds to a characteristic that we intend to identify (fraud, spam, disease). The classification model is based on a cloud of spheres that circumscribes the points of the class to be identified. It is intended to build a model … Read more

Analyzing Inexact Hypergradients for Bilevel Learning

Published: 2023/01/12

Matthias J. Ehrhardt

Lindon Roberts

Nonlinear Optimization, Optimization in Data Science automatic differentiation, bilevel optimization, hyperparameter optimization

Estimating hyperparameters has been a long-standing problem in machine learning. We consider the case where the task at hand is modeled as the solution to an optimization problem. Here the exact gradient with respect to the hyperparameters cannot be feasibly computed and approximate strategies are required. We introduce a unified framework for computing hypergradients that … Read more

A Levenberg-Marquardt Method for Nonsmooth Regularized Least Squares

Published: 2023/01/06

Dominique Orban

Aleksandr Y. Aravkin

Robert Baraldi

Data Science Algorithms, Nonlinear Systems and Least-Squares, Nonsmooth Optimization levenberg-marquardt method, nonconvex optimization, nonlinear least squares, nonsmooth optimization, proximal gradient method, regularized optimization

\(\) We develop a Levenberg-Marquardt method for minimizing the sum of a smooth nonlinear least-squares term \(f(x) = \frac{1}{2} \|F(x)\|_2^2\) and a nonsmooth term \(h\). Both \(f\) and \(h\) may be nonconvex. Steps are computed by minimizing the sum of a regularized linear least-squares model and a model of \(h\) using a first-order method such … Read more

Data-driven Prediction of Relevant Scenarios for Robust Combinatorial Optimization

Published: 2023/01/05

Marc Goerigk

Jannis Kurtz

Optimization in Data Science, Robust Optimization column-and-constraint generation method, machine learning, Scenario Prediction, two-stage robust optimization

We study iterative methods for (two-stage) robust combinatorial optimization problems with discrete uncertainty. We propose a machine-learning-based heuristic to determine starting scenarios that provide strong lower bounds. To this end, we design dimension-independent features and train a Random Forest Classifier on small-dimensional instances. Experiments show that our method improves the solution process for larger instances … Read more

A mixed-integer exponential cone programming formulation for feature subset selection in logistic regression

Published: 2022/12/22

Sahand Asgharieh Ahari

Burak Kocuk

Optimization in Data Science classification, machine learning, mixed-integer exponential cone programming, sparse logistic regression

Logistic regression is one of the widely-used classification tools to construct prediction models. For datasets with a large number of features, feature subset selection methods are considered to obtain accurate and interpretable prediction models, in which irrelevant and redundant features are removed. In this paper, we address the problem of feature subset selection in logistic … Read more

Expected Value of Matrix Quadratic Forms with Wishart distributed Random Matrices

Published: 2022/12/02, Updated: 2022/12/13

Melinda Hagedorn

Convex Optimization, Data Science Theory, Stochastic Approaches averaging, expected value, quadratic form, second momentum, stochastic gradient method, Wishart distribution

To explore the limits of a stochastic gradient method, it may be useful to consider an example consisting of an infinite number of quadratic functions. In this context, it is appropriate to determine the expected value and the covariance matrix of the stochastic noise, i.e. the difference of the true gradient and the approximated gradient … Read more

Deep learning and hyperparameter optimization for assessing one’s eligibility for a subcutaneous implantable cardioverter-defibrillator

Published: 2022/11/21

Alain Zemkoho

Applications - OR and Management Sciences, Applications - Science and Engineering, Optimization in Data Science deep learning, machine learning, optimization, subcutaneous implantable cardioverter defibrillators

In cardiology, it is standard for patients suffering from ventricular arrhythmias (the leading cause of sudden cardiac death) belonging to high risk populations to be treated using Subcutaneous Implantable Cardioverter-Defibrillators (S-ICDs). S-ICDs carry a risk of so-called T Wave Over Sensing (TWOS), which can lead to inappropriate shocks with an inherent health risk. For this … Read more

A polyhedral study of multivariate decision trees

Published: 2022/11/14

Carla Michini

Zachary Zhou

(Mixed) Integer Linear Programming, Data Science Theory, Polyhedra facet-defining inequality, mixed-integer programming, optimal decision tree

Decision trees are a widely used tool for interpretable machine learning. Multivariate decision trees employ hyperplanes at the branch nodes to route datapoints throughout the tree and yield more compact models than univariate trees. Recently, mixed-integer programming (MIP) has been applied to formulate the optimal decision tree problem. To strengthen MIP formulations, it is crucial … Read more

Inexact Proximal-Gradient Methods with Support Identification

Published: 2022/11/03

Yutong Dai

Daniel P. Robinson

Convex and Nonsmooth Optimization, Data Science Algorithms inexact proximal-gradient method, overlapping group regularizer, structured sparsity, support identification, worst-case iteration-complexity

\(\) We consider the proximal-gradient method for minimizing an objective function that is the sum of a smooth function and a non-smooth convex function. A feature that distinguishes our work from most in the literature is that we assume that the associated proximal operator does not admit a closed-form solution. To address this challenge, we … Read more