Data Mining techniques often ask for the resolution of optimization problems. Supervised Classification, and, in particular, Support Vector Machines, can be seen as a paradigmatic instance. In this paper, some links between Mathematical Optimization methods and Supervised Classification are emphasized. It is shown that many different areas of...
-
September 8, 2016 (v1)PublicationUploaded on: March 27, 2023
-
September 8, 2016 (v1)Publication
The two main and contradicting criteria guiding sampling design are accuracy of estimators and sampling costs. In stratified random sampling, the sample size must be allocated to strata in order to optimize both objectives. In this note we address, following a biobjective methodology, this allocation problem. A two-phase method is proposed to...
Uploaded on: December 4, 2022 -
April 23, 2021 (v1)Publication
A number of methods for multiple-objective optimization problems (MOP) give as solution to MOP the set of optimal solutions for some single-objective optimization problems associated with it. Well-known examples of these single-objective optimization problems are the minsum and the minmax. In this note, we propose a new parametric...
Uploaded on: December 5, 2022 -
April 20, 2021 (v1)Publication
The two main and contradicting criteria guiding sampling design are accuracy of estimators and sampling costs. In stratified random sampling, the sample size must be allocated to strata in order to optimize both objectives. In this note we address, following a biobjective methodology, this allocation problem. A two-phase method is proposed to...
Uploaded on: December 4, 2022 -
September 8, 2016 (v1)Publication
In the last decades there has been an increasing interest in environmental topics. This interest has been reflected in modeling the location of obnoxious facilities, as shown by the important number of papers published in this field. However, a very common drawback of the existing literature is that, as soon as environmental aspects are taken...
Uploaded on: December 5, 2022 -
December 9, 2022 (v1)Publication
Classification and regression trees, as well as their variants, are off-the-shelf methods in Machine Learning. In this paper, we review recent contributions within the Continuous Optimization and the Mixed-Integer Linear Optimization paradigms to develop novel formulations in this research area. We compare those in terms of the nature of the...
Uploaded on: March 24, 2023 -
July 4, 2022 (v1)Publication
Classifcation and regression trees, as well as their variants, are of-the-shelf meth ods in Machine Learning. In this paper, we review recent contributions within the Continuous Optimization and the Mixed-Integer Linear Optimization paradigms to develop novel formulations in this research area. We compare those in terms of the nature of the...
Uploaded on: March 24, 2023 -
September 8, 2016 (v1)Publication
Recently, Support Vector Machines with the ramp loss (RLM) have attracted attention from the computational point of view. In this technical note, we propose two heuristics, the first one based on solving the continuous relaxation of a Mixed Integer Nonlinear formulation of the RLM and the second one based on the training of an SVM classifier on...
Uploaded on: December 4, 2022 -
September 8, 2016 (v1)Publication
The default approach for tuning the parameters of a Support Vector Machine (SVM) is a grid search in the parameter space. Different metaheuristics have been recently proposed as a more efficient alternative, but they have only shown to be useful in models with a low number of parameters. Complex models, involving many parameters, can be seen...
Uploaded on: December 5, 2022 -
September 8, 2016 (v1)Publication
The widely used Support Vector Machine (SVM) method has shown to yield very good results in Supervised Classification problems. Other methods such as Classification Trees have become more popular among practitioners than SVM thanks to their interpretability, which is an important issue in Data Mining. In this work, we propose an SVM-based...
Uploaded on: December 4, 2022 -
December 9, 2016 (v1)Publication
In this paper we address the problem of visualizing a set of individuals, which have attached a statistical value given as a proportion, and a dissimilarity measure. Each individual is represented as a region within the unit square, in such a way that the area of the regions represent the proportions and the distances between them represent the...
Uploaded on: December 2, 2022 -
June 27, 2016 (v1)Publication
The support vector machine (SVM) is a state-of-the-art method in supervised classification. In this paper the Cluster Support Vector Machine (CLSVM) methodology is proposed with the aim to increase the sparsity of the SVM classifier in the presence of categorical features, leading to a gain in interpretability. The CLSVM methodology clusters...
Uploaded on: March 27, 2023 -
April 26, 2021 (v1)Publication
In this paper we propose an optimization model and a solution approach to visualize datasets which are made up of individuals observed along different time periods. These individuals have attached a time-dependent magnitude and a dissimilarity measure, which may vary over time. Difference of convex optimization techniques, namely, the so-called...
Uploaded on: March 25, 2023 -
September 8, 2016 (v1)Publication
The widely used Support Vector Machine (SVM) method has shown to yield good results in Supervised Classification problems. When the interpretability is an important issue, then classification methods such as Classification Trees (CART) might be more attractive, since they are designed to detect the important predictor variables and, for each...
Uploaded on: March 27, 2023 -
June 27, 2016 (v1)Publication
In linear classifiers, such as the Support Vector Machine (SVM), a score is associated with each feature and objects are assigned to classes based on the linear combination of the scores and the values of the features. Inspired by discrete psychometric scales, which measure the extent to which a factor is in agreement with a statement, we...
Uploaded on: March 27, 2023 -
May 2, 2017 (v1)Publication
In this paper we address the problem of visualizing in a bounded region a set of individuals, which has attached a dissimilarity measure and a statistical value, as convex objects. This problem, which extends the standard Multidimensional Scaling Analysis, is written as a global optimization problem whose objective is the difference of two...
Uploaded on: March 27, 2023