Published December 2006
| Version v1
Conference paper
Report on the XML Mining Track at INEX 2005 and INEX 2006, Categorization and Clustering of XML Documents
Contributors
Others:
- Machine Learning and Information Retrieval (MALIRE) ; Laboratoire d'Informatique de Paris 6 (LIP6) ; Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
- Usage-centered design, analysis and improvement of information systems (AxIS) ; Centre Inria d'Université Côte d'Azur (CRISAM) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Inria Paris-Rocquencourt ; Institut National de Recherche en Informatique et en Automatique (Inria)
- Fuhr, N.
- Lalmas, M.
- Malik, S.
- Kazai, G.
Description
This article is a report concerning the two years of the XML Mining track at INEX (2005 and 2006). We focus here on the classification and clustering XML documents. We detail these two tasks and the corpus used for this challenge and then present a summary of the different methods proposed by the participants. We last compare the results obtained during the two years of the track.
Abstract
International audienceAdditional details
Identifiers
- URL
- https://inria.hal.science/inria-00173420
- URN
- urn:oai:HAL:inria-00173420v1
Origin repository
- Origin repository
- UNICA