Data-Mining – Page 16 – Optimization Online

A Tabu Search Algorithm for Partitioning

Published: 2004/12/03

Data-Mining, Meta Heuristics, Statistics clustering, data mining, metaheuristics, tabu search

We present an original method for partitioning by automatic classi- fication, using the optimization technique of tabu search. The method uses a classical tabu search scheme based on transfers for the minimization of the within variance; it introduces in the tabu list the indicator of the object transfered. This method is compared with two stochastic … Read more

A Mixed-Integer Programming Approach to Multi-Class Data Classification Problem

Published: 2004/11/11

Metin Turkay

Fadime Uney

(Mixed) Integer Linear Programming, Data-Mining boolean algebra, data classification, data mining, mixed-integer programming

This paper presents a new data classification method based on mixed-integer programming. Traditional approaches that are based on partitioning the data sets into two groups perform poorly for multi-class data classification problems. The proposed approach is based on the use of hyper-boxes for defining boundaries of the classes that include all or some of the … Read more

Optimal distance separating halfspace

Published: 2004/10/05

Emilio Carrizosa

Frank Plastria

Data-Mining, Global Optimization Applications discriminant analysis, norm-distance to hyperplane, separating halfspace

One recently proposed criterion to separate two datasets in discriminant analysis, is to use a hyperplane which minimises the sum of distances to it from all the misclassified data points. Here all distances are supposed to be measured by way of some fixed norm,while misclassification means lying on the wrong side of the hyperplane, or … Read more

Optimal expected-distance separating halfspace

Published: 2004/10/05

Emilio Carrizosa

Frank Plastria

Data-Mining, Global Optimization Applications, Statistics discriminant analysis, norm-distance to hyperplane, separating halfspace

One recently proposed criterion to separate two datasets in discriminant analysis, is to use a hyperplane which minimises the sum of distances to it from all the misclassified data points. Here all distances are supposed to be measured by way of some fixed norm, while misclassification means lying on the wrong side of the hyperplane, … Read more

A New Computational Approach to Density Estimation with Semidefinite Programming

Published: 2003/11/29, Updated: 2003/12/19

Tadayoshi Fushiki

Takashi Tsuchiya

Shingo Horiuchi

Data-Mining, Semi-definite Programming, Statistics aic, density estimation, maximum likelihood estimation, semidefinite programming, statistics

Density estimation is a classical and important problem in statistics. The aim of this paper is to develop a new computational approach to density estimation based on semidefinite programming (SDP), a new technology developed in optimization in the last decade. We express a density as the product of a nonnegative polynomial and a base density … Read more

Gradient Projection Methods for Quadratic Programs and Applications in Training Support Vector Machines

Published: 2003/07/30, Updated: 2007/11/16

Thomas Serafini

Luca Zanni

Gaetano Zanghirati

Data-Mining, Quadratic Programming decomposition, gradient projection methods, large-scale optimization, quadratic programming, support vector machines

Gradient projection methods based on the Barzilai-Borwein spectral steplength choices are considered for quadratic programming problems with simple constraints. Well known nonmonotone spectral projected gradient methods and variable projection methods are discussed. For both approaches the behavior of different combinations of the two spectral steplengths is investigated. A nw adaptive stplength alternating rule is proposed, … Read more

The Maximum Box Problem and its Application to Data Analysis

Published: 2002/01/16

Combinatorial Optimization, Data-Mining Branch-and-Bound, data analysis, maximum box

Given two finite sets of points $X^+$ and $X^-$ in $\R^n$, the maximum box problem consists in finding an interval (“box”) $B=\{x : l \leq x \leq u\}$ such that $B\cap X^-=\emptyset$, and the cardinality of $B\cap X^+$ is maximized. A simple generalization can be obtained by instead maximizing a weighted sum of the elements … Read more

Semismooth Support Vector Machines

Published: 2000/11/30

Michael C. Ferris

Todd S. Munson

Complementarity and Variational Inequalities, Data-Mining convergence analysis, data mining, semismooth method

The linear support vector machine can be posed as a quadratic program in a variety of ways. In this paper, we look at a formulation using the two-norm for the misclassification error that leads to a positive definite quadratic program with a single equality constraint when the Wolfe dual is taken. The quadratic term is … Read more

Interior point methods for massive support vector machines

Published: 2000/08/22, Updated: 2006/04/04

Michael C. Ferris

Todd S. Munson

Data-Mining interior point methods, linear algebra, No description support-vector-machines

We investigate the use of interior point methods for solving quadratic programming problems with a small number of linear constraints where the quadratic term consists of a low-rank update to a positive semi-definite matrix. Several formulations of the support vector machine fit into this category. An interesting feature of these particular problems is the volume … Read more