Intellectual Property Invention ID2326WW00 registered by Intellectual Property Board, Amadeus S.A.S., Sophia Antipolis, France. Amadeus IP Invention licensed by Defensive Publications in the CIKM 2020 and DATA 2020 International Conferences: [1] Tianshu Yang, Nicolas Pasquier, Antoine Hom, Laurent Dolle, Frédéric Precioso. "Semi-supervised...
-
July 7, 2020 (v1)PatentUploaded on: December 4, 2022
-
November 2, 2007 (v1)Conference paper
GENMINER is a smart adaptation of closed itemsets based association rules extraction to genomic data. It takes advantage of the novel NORDI discretization method and of the JCLOSE algorithm to efficiently generate minimal non-redundant association rules. GENMINER facilitates the integration of numerous sources of biological information such as...
Uploaded on: December 3, 2022 -
October 3, 2008 (v1)Conference paper
During the last decade, several clustering and association rule mining techniques have been applied to identify groups of co-regulated genes in gene expression data. Nowadays, integrating biological knowledge and gene expression data into a single framework has become a major challenge to improve the relevance of mined patterns and simplify...
Uploaded on: December 3, 2022 -
September 15, 2014 (v1)Conference paper
Biology has become an enormously data-rich subject. Data is generated in many flavors and follows particularities of the omics perspective adopted along experimental studies. For instance, genomics is the field of study dealing with genomes and it is mostly associated with the static view (the genes and where they are placed along the genome)....
Uploaded on: March 1, 2023 -
April 1, 2009 (v1)Journal article
During the last decade, several clustering and association rule mining techniques have been applied to highlight groups of co-regulated genes in gene expression data. Nowadays, integrating these data and biological knowledge into a single framework has become a ma- jor challenge to improve the relevance of mined patterns and simplify their...
Uploaded on: December 3, 2022 -
October 15, 2009 (v1)Conference paper
During the last decade, several clustering and association rule mining techniques have been applied to highlight groups of co-regulated genes in gene expression data. Nowadays, integrating these data and biological knowledge into a single framework has become a ma- jor challenge to improve the relevance of mined patterns and simplify their...
Uploaded on: March 26, 2023 -
November 2, 2007 (v1)Conference paper
GENMINER is a smart adaptation of closed itemsets based association rules extraction to genomic data. It takes advantage of the novel NORDI discretization method and of the CLOSE algorithm to efficiently generate minimal non-redundant association rules. GENMINER facilitates the integration of numerous sources of biological information such as...
Uploaded on: March 26, 2023 -
November 15, 2008 (v1)Journal article
GenMiner is an implementation of association rule discovery dedicated to the analysis of genomic data. It allows the analysis of datasets integrating multiple sources of biological data represented as both discrete values, such as gene annotations, and continuous values, such as gene expression measures. GenMiner implements the new NorDi...
Uploaded on: December 3, 2022 -
October 3, 2005 (v1)Conference paper
Using several analyse techniques for the hierarchical clustering of a SAGE expression dataset of 822 tags from 74 tissue samples (normal and cancer) we show that cleaning the dataset (tags and experiments) is critical and that attribution of a tag to a gene is not easy. Comparison of cancers from various tissues is a difficult task as tissue...
Uploaded on: March 26, 2023 -
December 18, 2008 (v1)Journal article
Biology is now an information-intensive science and various research areas, like molecular biology, evolutionary biology or environmental biology, heavily depend on the availability and the efficient use of information. Data mining, that regroups several techniques for analyzing very large datasets, is used to solve problems in an increasing...
Uploaded on: December 3, 2022 -
May 30, 2006 (v1)Conference paper
La technologie des biopuces permet de mesurer les niveaux d'expression de milliers de gènes dans différentes conditions biologiques générant ainsi des masses de données à analyser. De nos jours, l'interprétation de ces volumineux jeux de donnés à la lumière des différentes sources d'informations est l'un des principaux défis dans la...
Uploaded on: December 3, 2022 -
October 3, 2005 (v1)Conference paper
Using several analyse techniques for the hierarchical clustering of a SAGE expression dataset of 822 tags from 74 tissue samples (normal and cancer) we show that cleaning the dataset (tags and experiments) is critical and that attribution of a tag to a gene is not easy. Comparison of cancers from various tissues is a difficult task as tissue...
Uploaded on: December 3, 2022 -
October 11, 2006 (v1)Conference paper
Microarray technology produces vast amounts of data by measuring simultaneously the expression levels of thousands of genes under hundreds of biological conditions. Nowadays, one of the principal challenges in bioinformatics is the interpretation of huge data using different sources of information. We propose a novel data analysis method named...
Uploaded on: December 3, 2022 -
May 31, 2009 (v1)Book section
After more than one decade of researches on association rule mining, efficient and scalable techniques for the discovery of relevant association rules from large high-dimensional datasets are now available. Most initial studies have focused on the development of theoretical frameworks and efficient algorithms and data structures for association...
Uploaded on: December 3, 2022 -
April 29, 2005 (v1)Book section
In the domain of knowledge discovery in databases and its computational part called data mining, many works addressed the problem of association rule extraction that aims at discovering relationships between sets of items (binary attributes). An example association rule fitting in the context of market basket data analysis is cereal ∧ milk →...
Uploaded on: December 3, 2022 -
August 28, 2006 (v1)Journal article
Microarray technology produces vast amounts of data by measuring simultaneously the expression levels of thousands of genes under hundreds of biological conditions. Nowadays, one of the principal challenges in bioinformatics is the interpretation of this large amount of data using different sources of information. We have developed a novel data...
Uploaded on: February 28, 2023 -
April 1, 2008 (v1)Journal article
La technologie des biopuces permet de mesurer les niveaux d'expression de milliers de gènes dans différentes conditions biologiques générant ainsi des masses de données à analyser. De nos jours, l'interprétation de ces volumineux jeux de donnés à la lumière des différentes sources d'informations est l'un des principaux défis dans la...
Uploaded on: December 3, 2022 -
September 6, 2006 (v1)Conference paper
La technologie des biopuces permet de mesurer les niveaux d'expression de milliers de gènes dans différentes conditions biologiques générant ainsi des masses de données à analyser. De nos jours, l'interprétation de ces volumineux jeux de donnés à la lumière des différentes sources d'informations est l'un des principaux défis dans la...
Uploaded on: March 25, 2023 -
January 1, 2008 (v1)Journal article
La technologie des biopuces permet de mesurer les niveaux d'expression de milliers de gènes dans différentes conditions biologiques générant ainsi des masses de données à analyser. De nos jours, l'interprétation de ces volumineux jeux de données à la lumière des différentes sources d'information est l'un des principaux défis dans la...
Uploaded on: January 26, 2024 -
August 19, 2020 (v1)Conference paper
In the travel industry context, customer segmentation, that is the clustering of travelers to distinguish segments of customers with similar needs and desires, is a major issue for improving the personalization of recommendations in flight search queries. Indeed, when booking travel itineraries, different customers purchase tickets according to...
Uploaded on: December 4, 2022 -
April 1, 2021 (v1)Journal article
Customer Choice Modeling aims to model the decision-making process of customers, or segments of customers, through their choices and preferences identified by the analysis of their behaviors in one or more specific contexts. Clustering techniques are used in this context to identify patterns in their choices and preferences, to define segments...
Uploaded on: December 4, 2022 -
January 6, 2014 (v1)Book section
International audience
Uploaded on: February 28, 2023 -
July 7, 2020 (v1)Conference paper
We present a semi-supervised ensemble clustering framework for identifying relevant multi-level clusters, regarding application objectives, in large datasets and mapping them to application classes for predicting the class of new instances. This framework extends the MultiCons closed sets based multiple consensus clustering approach but can...
Uploaded on: December 4, 2022 -
April 27, 2020 (v1)Publication
International audience
Uploaded on: December 4, 2022 -
January 17, 2006 (v1)Conference paper
Le Système d'Information Conceptuel ExCIS pour l'extraction de connaissances est une approche s'inspirant de CRISP- DM et intégrant le support des ontologies. Il permet de définir une ontologie de représentation des connaissances expertes du domaine, prenant en compte les besoins de la fouille de donnée, afin d'améliorer la pertinence des...
Uploaded on: December 3, 2022