Data-Mining – Page 9 – Optimization Online

Finding the largest low-rank clusters with Ky Fan 2-k-norm and l1-norm

Published: 2014/03/24

Convex Optimization, Data-Mining ky fan 2-k-norm, low-rank clustering, nonnegative matrix factorization

We propose a convex optimization formulation with the Ky Fan 2-k-norm and l1-norm to find k largest approximately rank-one submatrix blocks of a given nonnegative matrix that has low-rank block diagonal structure with noise. We analyze low-rank and sparsity structures of the optimal solutions using properties of these two matrix norms. We show that, under … Read more

A Stochastic Quasi-Newton Method for Large-Scale Optimization

Published: 2014/01/29, Updated: 2015/02/17

Data-Mining, Stochastic Programming, Unconstrained Optimization limited memory bfgs, machine learning, quasi-newton methods, stochastic optimization

Abstract The question of how to incorporate curvature information in stochastic approximation methods is challenging. The direct application of classical quasi- Newton updating techniques for deterministic optimization leads to noisy curvature estimates that have harmful effects on the robustness of the iteration. In this paper, we propose a stochastic quasi-Newton method that is efficient, robust … Read more

Alternating direction method of multipliers for sparse zero-variance discriminant analysis and principal component analysis

Published: 2014/01/22

Brendan Ames

Mingyi Hong

Data-Mining, Statistics

We consider the task of classification in the high-dimensional setting where the number of features of the given data is significantly greater than the number of observations. To accomplish this task, we propose sparse zero-variance discriminant analysis (SZVD) as a method for simultaneouslyperforming linear discriminant analysis and feature selection on high-dimensional data. This method combines … Read more

Uniqueness Conditions for A Class of $\ell_0hBcMinimization Problems

Published: 2013/12/19, Updated: 2014/01/30

C Xu

Yun-Bin Zhao

Applications - OR and Management Sciences, Applications - Science and Engineering, Data-Mining $\ell_0hBcminimization, uniqueness condition

We consider a class of $\ell_0$-minimization problems, which is to search for the partial sparsest solution to an underdetermined linear system with additional constraints. We introduce several concepts, including $l_p$-induced quasi-norm ($0

Strongly Agree or Strongly Disagree?: Rating Features in Support Vector Machines

Published: 2013/10/15, Updated: 2014/06/20

Emilio Carrizosa

Dolores Romero Morales

Amaya Nogales-Gómez

(Mixed) Integer Linear Programming, Data-Mining feature rating level, interpretability, likert scale, mixed-integer linear programming, support vector machines

In linear classifiers, such as the Support Vector Machine (SVM), a score is associated with each feature and objects are assigned to classes based on the linear combination of the scores and the values of the features. Inspired by discrete psychometric scales, which measure the extent to which a factor is in agreement with a … Read more

Semidefinite Programming Based Preconditioning for More Robust Near-Separable Nonnegative Matrix Factorization

Published: 2013/10/08

Nicolas Gillis

Stephen A. Vavasis

Data-Mining, Semi-definite Programming nonnegative matrix factorization, preconditioning, robustness to noise, semidefinite programming, separability

Nonnegative matrix factorization (NMF) under the separability assumption can provably be solved efficiently, even in the presence of noise, and has been shown to be a powerful technique in document classification and hyperspectral unmixing. This problem is referred to as near-separable NMF and requires that there exists a cone spanned by a small subset of … Read more

Incremental Accelerated Gradient Methods for SVM Classification: Study of the Constrained Approach

Published: 2013/05/02, Updated: 2013/08/30

Nicolas Couellan

Sophie Jan

Constrained Nonlinear Optimization, Data-Mining accelerated gradient, constrained gradient, incremental gradient method, kernel technique, machine learning, nonlinear programming, support vector machines

We investigate constrained first order techniques for training Support Vector Machines (SVM) for online classification tasks. The methods exploit the structure of the SVM training problem and combine ideas of incremental gradient technique, gradient acceleration and successive simple calculations of Lagrange multipliers. Both primal and dual formulations are studied and compared. Experiments show that the … Read more

Robust Near-Separable Nonnegative Matrix Factorization Using Linear Optimization

Published: 2013/02/19

Nicolas Gillis

Robert Luce

Data-Mining, Linear Programming hyperspectral unmixing, linear programming, nonnegative matrix factorization, pure-pixel assumption, robustness, separability

Nonnegative matrix factorization (NMF) has been shown recently to be tractable under the separability assumption, under which all the columns of the input data matrix belong to the convex cone generated by only a few of these columns. Bittorf, Recht, R\’e and Tropp (`Factoring nonnegative matrices with linear programs’, NIPS 2012) proposed a linear programming … Read more

A Continuous Characterization of the Maximum-Edge Biclique Problem

Published: 2012/12/20

Nicolas Gillis

François Glineur

Data-Mining, Meta Heuristics algorithmic complexity, biclique finding algorithm, maximum-edge biclique problem, nonnegative rank-one approximation

The problem of finding large complete subgraphs in bipartite graphs (that is, bicliques) is a well-known combinatorial optimization problem referred to as the maximum-edge biclique problem (MBP), and has many applications, e.g., in web community discovery, biological data analysis and text mining. In this paper, we present a new continuous characterization for MBP. Given a … Read more