Data-Mining – Page 4 – Optimization Online

Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees

Published: 2020/01/23, Updated: 2023/05/01

This paper studies a classic maximum entropy sampling problem (MESP), which aims to select the most informative principal submatrix with a given size out of a covariance matrix from a system. MESP has been widely applied to many areas, including healthcare, power system, manufacturing, data science, etc. Investigating its Lagrangian dual and primal characterization, we … Read more

The perturbation analysis of nonconvex low-rank matrix robust recovery

Published: 2019/12/25

Jianwen Huang

Wang Wendong

Feng Zhang

Applications - Science and Engineering, Data-Mining

In this paper, we bring forward a completely perturbed nonconvex Schatten $p$-minimization to address a model of completely perturbed low-rank matrix recovery. The paper that based on the restricted isometry property generalizes the investigation to a complete perturbation model thinking over not only noise but also perturbation, gives the restricted isometry property condition that guarantees … Read more

Online matrix factorization for Markovian data and applications to Network Dictionary Learning

Published: 2019/11/11

Laura Balzano

Hanbaek Lyu

Deanna Needell

Data-Mining, Nonlinear Optimization convergence analysis, dictionary learning, markovian data, non-negative matrix factorization, online matrix factorization

Online Matrix Factorization (OMF) is a fundamental tool for dictionary learning problems, giving an approximate representation of complex data sets in terms of a reduced number of extracted features. Convergence guarantees for most of the OMF algorithms in the literature assume independence between data matrices, and the case of a dependent data stream remains largely … Read more

On the Cluster-aware Supervised Learning (CluSL): Frameworks, Convergent Algorithms, and Applications

Published: 2019/10/29, Updated: 2020/10/20

Shutong Chen

Weijun Xie

Data-Mining alternating projection, clustering, feature extraction, globally convergent, regularization, supervised learning

This paper proposes a cluster-aware supervised learning (CluSL) framework, which integrates the clustering analysis with supervised learning (SL). The objective of CluSL is to simultaneously find the best clusters of the data points and minimize the sum of loss functions within each cluster. This framework has many potential applications in healthcare, operations management, manufacturing, and … Read more

A sparse semismooth Newton based augmented Lagrangian method for large-scale support vector machines

Published: 2019/10/01

Convex and Nonsmooth Optimization, Data-Mining augmented lagrangian method, semismooth newton method, support vector machines

Support vector machines (SVMs) are successful modeling and prediction tools with a variety of applications. Previous work has demonstrated the superiority of the SVMs in dealing with the high dimensional, low sample size problems. However, the numerical difficulties of the SVMs will become severe with the increase of the sample size. Although there exist many … Read more

Distance geometry and data science

Published: 2019/09/17

Leo Liberti

Data-Mining, Global Optimization Applications, Graphs and Matroids artificial neural networks, euclidean distance, isometric embedding, machine learning, mathematical programming, random projection

Data are often represented as graphs. Many common tasks in data science are based on distances between entities. While some data science methodologies natively take graphs as their input, there are many more that take their input in vectorial form. In this survey we discuss the fundamental problem of mapping graphs to vectors, and its … Read more

Optimal K-Thresholding Algorithms for Sparse Optimization Problems

Published: 2019/06/26, Updated: 2019/08/30

Yun-Bin Zhao

Constrained Nonlinear Optimization, Convex Optimization, Data-Mining convex optimization, hard thresholding, iterative algorithms, optimal hBcthresholding, restricted isometry property, sparse optimization

The simulations indicate that the existing hard thresholding technique independent of the residual function may cause a dramatic increase or numerical oscillation of the residual. This inherit drawback of the hard thresholding renders the traditional thresholding algorithms unstable and thus generally inefficient for solving practical sparse optimization problems. How to overcome this weakness and develop … Read more

An analysis of noise folding for low-rank matrix recovery

Published: 2019/04/28

Approximation Algorithms, Convex Optimization, Data-Mining

Previous work regarding low-rank matrix recovery has concentrated on the scenarios in which the matrix is noise-free and the measurements are corrupted by noise. However, in practical application, the matrix itself is usually perturbed by random noise preceding to measurement. This paper concisely investigates this scenario and evidences that, for most measurement schemes utilized in … Read more

High-Order Evaluation Complexity for Convexly-Constrained Optimization with Non-Lipschitzian Group Sparsity Terms

Published: 2019/02/27

Xiaojun Chen

Philippe L. Toint

Constrained Nonlinear Optimization, Data-Mining, Statistics complexity, group sparsity, isotropic model, non-lipschitz functions, nonlinear optimization, partially-separable problems

This paper studies high-order evaluation complexity for partially separable convexly-constrained optimization involving non-Lipschitzian group sparsity terms in a nonconvex objective function. We propose a partially separable adaptive regularization algorithm using a $p$-th order Taylor model and show that the algorithm can produce an (epsilon,delta)-approximate q-th-order stationary point in at most O(epsilon^{-(p+1)/(p-q+1)}) evaluations of the objective … Read more

Approximating L1-Norm Best-Fit Lines

Published: 2019/01/09

J. Paul Brooks

JH Dulá

Data-Mining

Sufficient conditions are provided for a deterministic algorithm for estimating an L1-norm best-fit one-dimensional subspace. To prove the conditions are sufficient, fundamental properties of the L1-norm projection of a point onto a one-dimensional subspace are derived. Also, an equivalence is established between the algorithm, which involves the calculation of several weighted medians, and independently-derived algorithms … Read more