Extraction d'entités dans des collections évolutives

Despeyroux, Thierry; Fraschini, Eduardo; Vercoustre, Anne-Marie

Published January 23, 2007 | Version v1

Conference paper Metadata-only

Extraction d'entités dans des collections évolutives

Contributors

Others:

Usage-centered design, analysis and improvement of information systems (AxIS) ; Centre Inria d'Université Côte d'Azur (CRISAM) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Inria Paris-Rocquencourt ; Institut National de Recherche en Informatique et en Automatique (Inria)
M. Noirhomme-Fraiture and G. Venturini

The goal of our work is to use a set of reports and extract named entities, in our case the names of Industrial or Academic partners. Starting with an initial list of entities, we use a first set of documents to identify syntactic patterns that are then validated in a supervised learning phase on a set of annotated documents. The complete collection is then explored. This approach is similar to the ones used in data extraction from semi-structured documents (wrappers) and do not need any linguistic resources neither a large set for training. As our collection of documents would evolve over years , we hope that the performance of the extraction would improve with the increased size of the training set.

Abstract

The bibteX file has been replaced with the correct one.

Abstract

International audience

Additional details

URL: https://inria.hal.science/inria-00116910
URN: urn:oai:HAL:inria-00116910v4

Origin repository: UNICA

	All versions	This version
Views	5	5
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Extraction d'entités dans des collections évolutives

Creators

Contributors

Others:

Description

Abstract

Abstract

Additional details

Identifiers

Origin repository