Optimization in Data Science – Page 4

Stochastic Approximation with Block Coordinate Optimal Stepsizes

Published: 2025/07/16

Data Science Algorithms, Optimization in Data Science adaptive algorithms, block coordinate stepsizes, stochastic approximation

We consider stochastic approximation with block-coordinate stepsizes and propose adaptive stepsize rules that aim to minimize the expected distance from the next iterate to an optimal point. These stepsize rules employ online estimates of the second moment of the search direction along each block coordinate. The popular Adam algorithm can be interpreted as a particular … Read more

Recursive Bound-Constrained AdaGrad with Applications to Multilevel and Domain Decomposition Minimization

Published: 2025/07/15

Serge Gratton

Alena Kopanicakova

Philippe L. Toint

Bound-constrained Optimization, Data Science Algorithms, Nonlinear Optimization Adagrad, bound constraints, complexity, domain decomposition, multilevel optimization, neural network training, PDE-based problems

Two OFFO (Objective-Function Free Optimization) noise tolerant algorithms are presented that handle bound constraints, inexact gradients and use second-order information when available. The first is a multi-level method exploiting a hierarchical description of the problem and the second is a domain-decomposition method covering the standard addditive Schwarz decompositions. Both are generalizations of the first-order AdaGrad … Read more

A Randomized Algorithm for Sparse PCA based on the Basic SDP Relaxation

Published: 2025/07/12

Alberto Del Pia

Dekun Zhou

(Mixed) Integer Linear Programming, Data Science Algorithms, Semi-definite Programming

Sparse Principal Component Analysis (SPCA) is a fundamental technique for dimensionality reduction, and is NP-hard. In this paper, we introduce a randomized approximation algorithm for SPCA, which is based on the basic SDP relaxation. Our algorithm has an approximation ratio of at most the sparsity constant with high probability, if called enough times. Under a … Read more

Constrained Enumeration of Lucky Tickets: Prime Digits, Uniqueness, and Greedy Heuristics

Published: 2025/07/08, Updated: 2025/07/29

Bismark Singh

Basic Sciences Applications, Optimization in Data Science

We revisit the classical Lucky Ticket (LT) enumeration problem, in which an even-digit number is called lucky if the sum of the digits of its first half equals to that of its second half. We introduce two new subclasses — SuperLucky Tickets (SLTs), where all digits are distinct, and LuckyPrime Tickets (LPTs), where all digits … Read more

Complexity of normalized stochastic first-order methods with momentum under heavy-tailed noise

Published: 2025/06/26

Nonlinear Optimization, Optimization in Data Science, Unconstrained Optimization first-order oracle complexity, heavy-tailed noise, momentum, stochastic first-order methods

In this paper, we propose practical normalized stochastic first-order methods with Polyak momentum, multi-extrapolated momentum, and recursive momentum for solving unconstrained optimization problems. These methods employ dynamically updated algorithmic parameters and do not require explicit knowledge of problem-dependent quantities such as the Lipschitz constant or noise bound. We establish first-order oracle complexity results for finding … Read more

A Variational Analysis Approach for Bilevel Hyperparameter Optimization with Sparse Regularization

Published: 2025/06/23

David Villacís

Pedro Pérez-Aros

Emilio Vilches

Bilevel Optimization, Nonsmooth Optimization, Optimization in Data Science bilevel optimization, nonsmooth analysis, Sparse Regression and Classification Models, variational analysis

We study a bilevel optimization framework for hyperparameter learning in variational models, with a focus on sparse regression and classification tasks. In particular, we consider a weighted elastic-net regularizer, where feature-wise regularization parameters are learned through a bilevel formulation. A key novelty of our approach is the use of a Forward-Backward (FB) reformulation of the … Read more

Toward Decision-Oriented Prognostics: An Integrated Estimate-Optimize Framework for Predictive Maintenance

Published: 2025/06/18, Updated: 2025/06/19

Zhuojun Xie

Adam Abdin

Yiping Fang

Optimization in Data Science, Stochastic Programming data-driven optimization, Estimate-then-Optimize, Integrated Estimate-Optimize, machine learning, Predictive maintenance

Recent research increasingly integrates machine learning (ML) into predictive maintenance (PdM) to reduce operational and maintenance costs in data-rich operational settings. However, uncertainty due to model misspecification continues to limit widespread industrial adoption. This paper investigates a PdM framework in which sensor-driven prognostics inform decision-making under economic trade-offs within a finite decision space. We investigate … Read more

Two-way Cutting-plane Algorithm for Best Subset Selection Considering Multicollinearity

Published: 2025/06/05

Yuichi Takano

Global Optimization, Integer Programming, Optimization in Data Science linear regression, logistic regression, mixed integer optimization, multicollinearity, subset selection

When linear dependence exists between some explanatory variables in a regression model, the estimates of regression coefficients become unstable, thereby making the interpretation of the estimation results unreliable. To eliminate such multicollinearity, we propose a high-performance method for selecting the best subset of explanatory variables for linear and logistic regression models. Specifically, we first derive … Read more

Retrospective Approximation Sequential Quadratic Programming for Stochastic Optimization with General Deterministic Nonlinear Constraints

Published: 2025/05/26

Shagun Gupta

Raghu Bollapragada

Albert S. Berahas

Constrained Nonlinear Optimization, Nonlinear Optimization, Optimization in Data Science constrained optimization, nonlinear optimization, retrospective approximation, Sequential Qaudratic Programming, stochastic optimization

In this paper, we propose a framework based on the Retrospective Approximation (RA) paradigm to solve optimization problems with a stochastic objective function and general nonlinear deterministic constraints. This framework sequentially constructs increasingly accurate approximations of the true problems which are solved to a specified accuracy via a deterministic solver, thereby decoupling the uncertainty from … Read more

Responsible Machine Learning via Mixed-Integer Optimization

Published: 2025/05/09, Updated: 2025/08/30

Applications - OR and Management Sciences, Integer Programming, Optimization in Data Science causal inference, fair machine learning, interpretable machine learning, machine learning robust to adversarial attacks, machine learning robust to distribution shifts, mixed integer optimization, robust optimization

In the last few decades, Machine Learning (ML) has achieved significant success across domains ranging from healthcare, sustainability, and the social sciences, to criminal justice and finance. But its deployment in increasingly sophisticated, critical, and sensitive areas affecting individuals, the groups they belong to, and society as a whole raises critical concerns around fairness, transparency … Read more