Usages linguistiques des éléments supplémentaires dans l'Analyse factorielle des correspondances
- Creators
- Mayaffre, Damon
- Vanni, Laurent
- Others:
- BCL, équipe Logométrie : corpus, traitements, modèles ; Bases, Corpus, Langage (UMR 7320 - UCA / CNRS) (BCL) ; Université Nice Sophia Antipolis (1965 - 2019) (UNS)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UniCA)-Université Nice Sophia Antipolis (1965 - 2019) (UNS)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UniCA)
- Anne Dister
- Dominique Longrée
Description
This contribution shows the interest of supplementary variables with Correspondence Analysis (CA). From a CA vector space crossing the main morpho-syntactic categories and the French Presidents of the fifth Republic, we project the lemma "indiquer" (to indicate) as a supplementary variable. Will it be located in the "verb subspace" of the graph? And if not, what should we conclude from this counterintuitive positioning? Beyond this example, it is the linguistic homogeneity of the rows of the contingency table (words, lemmas, grammatical categories, etc.) that we question, by projecting, in more or less interpretable ways, other linguistic elements into additional elements. The contribution ends with an open discussion on the complementarity of CA and deep neural networks.
Abstract
International audience
Additional details
- URL
- https://cnrs.hal.science/hal-04647313
- URN
- urn:oai:HAL:hal-04647313v1
- Origin repository
- UNICA