Automatic definition of the level of textual difficulty of documents

Tounsi Dhouib, Molka; Ekaterina, Kostrykina; Catherine, Faron

Published January 22, 2024 | Version v1

Conference paper Metadata-only

Automatic definition of the level of textual difficulty of documents

Contributors

Other:

Web-Instrumented Man-Machine Interactions, Communities and Semantics (WIMMICS) ; Inria Sophia Antipolis - Méditerranée (CRISAM) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Scalable and Pervasive softwARe and Knowledge Systems (Laboratoire I3S - SPARKS) ; Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S) ; Université Nice Sophia Antipolis (1965 - 2019) (UNS)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)-Université Nice Sophia Antipolis (1965 - 2019) (UNS)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)-Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S) ; Université Nice Sophia Antipolis (1965 - 2019) (UNS)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)-Université Nice Sophia Antipolis (1965 - 2019) (UNS)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)

For many educational applications, such as learning resource recommendation systems, the textual difficulty of a text is a key information. Today, the best results for this task are obtained by using NLP and deep learning techniques. However, the use of these methods can result in the loss of statistical linguistic information that is important for determining text readability more accurately. In our work, we propose an approach for assessing text readability by combining neural network models with linguistic features extracted from the text and integrated into the model to improve the quality of the neural network models. Experimental results show that this combination improves system performance.

Abstract (French)

La lisibilité ou la difficulté textuelle représentent une information importante pour de nombreuses applications éducatives telles que les systèmes de recommandation. Aujourd'hui, les meilleurs résultats pour cette tâche sont obtenus en utilisant des techniques de traitement automatique de la langue (TAL) et d'apprentissage profond. Dans ce travail, nous proposons une approche multilingue qui se base sur la combinaison des modèles de réseaux de neuronaux avec des caractéristiques linguistiques extraites du texte afin d'améliorer la qualité de l'évaluation. Nous avons testé notre approche sur deux benchmarks, et les résultats montrent que cette combinaison améliore les performances du système.

Abstract

National audience

Additional details

URL: https://hal.science/hal-04353060
URN: urn:oai:HAL:hal-04353060v1

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Automatic definition of the level of textual difficulty of documents

Creators

Contributors

Other:

Description

Abstract (French)

Abstract

Additional details

Identifiers