Data-Mining – Page 6 – Optimization Online

Smart “Predict, then Optimize”

Published: 2017/12/31, Updated: 2019/07/24

Applications - OR and Management Sciences, Data-Mining, Statistics

Many real-world analytics problems involve two significant challenges: prediction and optimization. Due to the typically complex nature of each challenge, the standard paradigm is to predict, then optimize. By and large, machine learning tools are intended to minimize prediction error and do not account for how the predictions will be used in a downstream optimization … Read more

Approximate Positively Correlated Distributions and Approximation Algorithms for D-optimal Design

Published: 2017/12/08, Updated: 2018/01/15

Mohit Singh

Weijun Xie

Approximation Algorithms, Data-Mining approximation algorithms, convex relaxation, d-optimal design, statistics

Experimental design is a classical problem in statistics and has also found new applications in machine learning. In the experimental design problem, the aim is to estimate an unknown vector x in m-dimensions from linear measurements where a Gaussian noise is introduced in each measurement. The goal is to pick k out of the given … Read more

Forecasting Solar Flares using magnetogram-based predictors and Machine Learning

Published: 2017/12/08

Federico Benvenuto

Manolis K. Georgoulis

Basic Sciences Applications, Data-Mining, Statistics flares, magnetic fields

We propose a forecasting approach for solar flares based on data from Solar Cycle 24, taken by the Helioseismic and Magnetic Imager (HMI) on board the Solar Dynamics Observatory (SDO) mission. In particular, we use the Space-weather HMI Active Region Patches (SHARP) product that facilitates cut-out magnetograms of solar active regions (AR) in the Sun … Read more

Sparse principal component analysis and its l1-relaxation

Published: 2017/12/03

(Mixed) Integer Nonlinear Programming, Data-Mining

Principal component analysis (PCA) is one of the most widely used dimensionality reduction methods in scientific data analysis. In many applications, for additional interpretability, it is desirable for the factor loadings to be sparse, that is, we solve PCA with an additional cardinality (l0) constraint. The resulting optimization problem is called the sparse principal component … Read more

Weak Stability of $\ell_1hBcminimization Methods in Sparse Data Reconstruction

Published: 2017/11/04

H. Jiang

Zhi-Quan Luo

Yun-Bin Zhao

Convex and Nonsmooth Optimization, Data-Mining, Linear Programming $\ell_1hBcminimization, convex optimization, linear programming, sparsity optimization, weak range space property, weak stability

As one of the most plausible convex optimization methods for sparse data reconstruction, $\ell_1$-minimization plays a fundamental role in the development of sparse optimization theory. The stability of this method has been addressed in the literature under various assumptions such as restricted isometry property (RIP), null space property (NSP), and mutual coherence. In this paper, … Read more

Estimating L1-Norm Best-Fit Lines for Data

Published: 2017/06/07, Updated: 2019/08/15

J. Paul Brooks

JH Dulá

Applications - OR and Management Sciences, Data-Mining, Statistics analytics, l1-norm, line location, principal component analysis

The general formulation for finding the L1-norm best-fit subspace for a point set in $m$-dimensions is a nonlinear, nonconvex, nonsmooth optimization problem. In this paper we present a procedure to estimate the L1-norm best-fit one-dimensional subspace (a line through the origin) to data in $\Re^m$ based on an optimization criterion involving linear programming but which … Read more

Size Matters: Cardinality-Constrained Clustering and Outlier Detection via Conic Optimization

Published: 2017/05/22

Approximation Algorithms, Data-Mining, Linear, Cone and Semidefinite Programming k-means clustering, optimality guarantee, outlier detection, semidenite programming

Plain vanilla K-means clustering is prone to produce unbalanced clusters and suffers from outlier sensitivity. To mitigate both shortcomings, we formulate a joint outlier-detection and clustering problem, which assigns a prescribed number of datapoints to an auxiliary outlier cluster and performs cardinality-constrained K-means clustering on the residual dataset. We cast this problem as a mixed-integer … Read more

Decomposition Algorithms for Distributionally Robust Optimization using Wasserstein Metric

Published: 2017/04/06

Fengqiao Luo

Sanjay Mehrotra

Data-Mining, Robust Optimization, Semi-infinite Programming cutting-surface algorithms, distributionally robust optimization, exchange method, semi-infinite programming, wasserstein distance

We study distributionally robust optimization (DRO) problems where the ambiguity set is dened using the Wasserstein metric. We show that this class of DRO problems can be reformulated as semi-innite programs. We give an exchange method to solve the reformulated problem for the general nonlinear model, and a central cutting-surface method for the convex case, … Read more

Random Sampling and Machine Learning to Understand Good Decompositions

Published: 2017/03/25, Updated: 2017/05/19

Saverio Basso

Alberto Ceselli

Andrea Tettamanzi

(Mixed) Integer Linear Programming, Data-Mining dantzig-wolfe decomposition, machine learning, random sampling

Motivated by its implications in the development of general purpose solvers for decomposable Mixed Integer Programs (MIP), we address a fundamental research question, that is to assess if good decomposition patterns can be consistently found by looking only at static properties of MIP input instances, or not. We adopt a data driven approach, devising a … Read more

Optimization Algorithms for Data Analysis

Published: 2016/12/01, Updated: 2019/02/21

Stephen Wright

Data-Mining, Nonsmooth Optimization, Unconstrained Optimization data analysis, optimization

We describe the fundamentals of algorithms for minimizing a smooth nonlinear function, and extensions of these methods to the sum of a smooth function and a convex nonsmooth function. Such objective functions are ubiquitous in data analysis applications, as we illustrate using several examples. We discuss methods that make use of gradient (first-order) information about … Read more