Data-Mining – Page 2 – Optimization Online

The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Published: 2022/03/10

Data-Mining, Meta Heuristics, Quadratic Programming loss function, network pruning, neural networks

Neural networks tend to achieve better accuracy with training if they are larger — even if the resulting models are overparameterized. Nevertheless, carefully removing such excess parameters before, during, or after training may also produce models with similar or even improved accuracy. In many cases, that can be curiously achieved by heuristics as simple as … Read more

Mixed-Integer Programming Techniques for the Minimum Sum-of-Squares Clustering Problem

Published: 2022/03/09, Updated: 2022/11/29

(Mixed) Integer Nonlinear Programming, Cutting Plane Approaches, Data-Mining Computational Techniques, global optimization, Minimum Sum-of-Squares Clustering, Mixed-integer nonlinear optimization

The minimum sum-of-squares clustering problem is a very important problem in data mining and machine learning with very many applications in, e.g., medicine or social sciences. However, it is known to be NP-hard in all relevant cases and to be notoriously hard to be solved to global optimality in practice. In this paper, we develop … Read more

Kernel Probabilistic Distance Clustering

Published: 2022/02/07

Dilay Ozkan

Cem Iyigun

Constrained Nonlinear Optimization, Data-Mining clustering, kernel functions, probabilistic distance clustering

CitationDepartment of Industrial Engineering, Middle East Technical University, Ankara, TurkeyArticleDownload View PDF

Stable Recovery of Sparse Signals With Non-convex Weighted $r$-Norm Minus $1$-Norm

Published: 2022/01/09

Constrained Nonlinear Optimization, Data-Mining, Nonsmooth Optimization compressed sensing, mutual coherence, sparse recovery, sufficient condition

Given the measurement matrix $A$ and the observation signal $y$, the central purpose of compressed sensing is to find the most sparse solution of the underdetermined linear system $y=Ax+z$, where $x$ is the $s$-sparse signal to be recovered and $z$ is the noise vector. Zhou and Yu \cite{Zhou and Yu 2019} recently proposed a novel … Read more

Time-series aggregation for the optimization of energy systems: goals, challenges, approaches, and opportunities

Published: 2022/01/02

Holger Teichgraeber

Adam R. Brandt

Data-Mining, Energy, Facility Planning and Design clustering, energy, optimization, Representative periods, Review, Typical days

The rising significance of renewable energy increases the importance of representing time-varying input data in energy system optimization studies. Time-series aggregation, which reduces temporal model complexity, has emerged in recent years to address this challenge. We provide a comprehensive review of time-series aggregation for the optimization of energy systems. We show where time series affect … Read more

New interior-point approach for one- and two-class linear support vector machines using multiple variable splitting

Published: 2021/12/17

Jordi Castro

Data-Mining, Linear, Cone and Semidefinite Programming interior point methods, large-scale optimization, multiple variable splitting, one-class support vector machine, support vector classifier

Multiple variable splitting is a general technique for decomposing problems by using copies of variables and additional linking constraints that equate their values. The resulting large optimization problem can be solved with a specialized interior-point method that exploits the problem structure and computes the Newton direction with a combination of direct and iterative solvers (i.e., … Read more

Analysis non-sparse recovery for non-convex relaxed $\ell_q$ minimization

Published: 2021/11/29

Data-Mining, Nonsmooth Optimization, Unconstrained Optimization $\ell_q$ robust $D$-Null Space Property, compressed sensing, Non-convex relaxed $\ell_q$ minimization method, Restricted isometry property adapted $D$, sparse recovery

This paper studies construction of signals, which are sparse or nearly sparse with respect to a tight frame $D$ from underdetermined linear systems. In the paper, we propose a non-convex relaxed $\ell_q(0 ArticleDownload View PDF

Mixed-Integer Optimization with Constraint Learning

Published: 2021/11/09

(Mixed) Integer Linear Programming, Applications - OR and Management Sciences, Data-Mining Constraint learning, machine learning, mixed integer optimization, prescriptive analytics

We establish a broad methodological foundation for mixed-integer optimization with learned constraints. We propose an end-to-end pipeline for data-driven decision making in which constraints and objectives are directly learned from data using machine learning, and the trained models are embedded in an optimization formulation. We exploit the mixed-integer optimization-representability of many machine learning methods, including … Read more

Inexact bilevel stochastic gradient methods for constrained and unconstrained lower-level problems

Published: 2021/10/01, Updated: 2022/12/07

Tommaso Giovannelli

Griffin D. Kent

Luis Nunes Vicente

Data-Mining, Nonlinear Optimization, Stochastic Programming bilevel optimization, DARTS, machine learning, stochastic gradient descent

Two-level stochastic optimization formulations have become instrumental in a number ofmachine learning contexts such as continual learning, neural architecture search, adversariallearning, and hyperparameter tuning. Practical stochastic bilevel optimization problemsbecome challenging in optimization or learning scenarios where the number of variables ishigh or there are constraints. In this paper, we introduce a bilevel stochastic gradient method … Read more

Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

Published: 2021/09/30, Updated: 2023/11/13

Dimitris Bertsimas

Ryan Cory-Wright

Nicholas A. G. Johnson

Applications - OR and Management Sciences, Data-Mining, Statistics Branch-and-Bound, convex relaxation, matrix decomposition, rank, sparsity

We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth. SLR is a fundamental problem in Operations Research and Machine Learning which arises in various applications, including data compression, latent semantic indexing, … Read more