Published July 15, 2010
| Version v1
Conference paper
Single Document Keyphrase Extraction Using Sentence Clustering and Latent Dirichlet Allocation
- Creators
- Pasquier, Claude
Description
This paper describes the design of a system for extracting keyphrases from a single document The principle of the algorithm is to cluster sentences of the documents in order to highlight parts of text that are semantically related. The clusters of sentences, that reflect the themes of the document, are then analyzed to find the main topics of the text. Finally, the most important words, or groups of words, from these topics are proposed as keyphrases.
Abstract
International audience
Additional details
- URL
- https://hal.archives-ouvertes.fr/hal-01151516
- URN
- urn:oai:HAL:hal-01151516v1
- Origin repository
- UNICA