El análisis de datos de expresión genética es una de las tareas fundamentales dentro de la Bioinformática. Para llevar a cabo este estudio se hace necesaria la aplicación de técnicas de Minería de Datos. Las técnicas de Clustering han probado ser de gran utilidad a la hora de descubrir grupos de genes que intervienen en una misma función...
-
February 23, 2022 (v1)PublicationUploaded on: December 5, 2022
-
December 12, 2022 (v1)Publication
In this paper, we propose a greedy clustering algorithm to identify groups of related genes and a new measure to improve the results of this algorithm. Clustering algorithms analyze genes in order to group those with similar behavior. Instead, our approach groups pairs of genes that present similar positive and/or negative interactions. In...
Uploaded on: February 22, 2023 -
April 7, 2016 (v1)Publication
Different ways of contrast generated rankings by feature selection algorithms are presented in this paper, showing several possible interpretations, depending on the given approach to each study. We begin from the premise of no existence of only one ideal subset for all cases. The purpose of these kinds of algorithms is to reduce the data set...
Uploaded on: December 4, 2022 -
May 27, 2022 (v1)Publication
The results of feature selection methods have a great influence on the success of data mining processes, especially when the data sets have high dimensionality. In order to find the optimal result from feature selection methods, we should check each possible subset of features to obtain the precision on classification, i.e., an exhaustive...
Uploaded on: March 25, 2023 -
July 20, 2022 (v1)Publication
The supervised learning methods applying evolutionary al gorithms to generate knowledge model are extremely costly in time and space. Fundamentally, this high computational cost is fundamentally due to the evaluation process that needs to go through the whole datasets to assess their goodness of the genetic individuals. Often, this process...
Uploaded on: March 25, 2023 -
May 26, 2022 (v1)Publication
In this paper we study a measure, named weakness of an example, which allows us to establish the importance of an example to find representative patterns for the data set editing problem. Our ap proach consists in reducing the database size without losing information, using algorithm patterns by ordered projections. The idea is to relax...
Uploaded on: March 25, 2023 -
July 21, 2022 (v1)Publication
The biclustering techniques have the purpose of finding subsets of genes that show similar activity patterns under a subset of conditions. In this paper we characterize a specific type of pattern, that we have called α–pattern, and present an approach that consists in a new biclustering algorithm specifically designed to find α–patterns, in...
Uploaded on: December 4, 2022 -
April 19, 2022 (v1)Publication
n this work, we propose a new greedy clustering algorithm to identify groups of related genes. Clustering algorithms analyze genes in order to group those with similar behavior. Instead, our approach groups pairs of genes that present similar positive and/or negative interactions. Our approach presents some interesting properties. For instance,...
Uploaded on: December 4, 2022 -
July 21, 2022 (v1)Publication
The great amount of biological information provides scientists with an incomparable framework for testing the results of new algorithms. Several tools have been developed for analysing gene-enrichment and most of them are Gene Ontology-based tools. We developed a Kyoto Encyclopedia of Genes and Genomes (Kegg)-based tool that provides a friendly...
Uploaded on: December 4, 2022 -
July 20, 2022 (v1)Publication
Microarray experiments help researches to construct the str ucture of gene regulatory networks, i.e., networks representing relation ships among different genes. Filter and knowledge extraction processes are necessary in order to handle the huge amount of data produced by microarray technologies. We propose regression trees techniques as a...
Uploaded on: March 25, 2023