Published July 9, 2023 | Version v1
Conference paper

Black-box language model explanation by context length probing

Description

The increasingly widespread adoption of large language models has highlighted the need for improving their explainability. We present context length probing, a novel explanation technique for causal language models, based on tracking the predictions of a model as a function of the length of available context, and allowing to assign differential importance scores to different contexts. The technique is model-agnostic and does not rely on access to model internals beyond computing token-level probabilities. We apply context length probing to large pre-trained language models and offer some initial analyses and insights, including the potential for studying long-range dependencies. The source code and a demo of the method are available.

Abstract

International audience

Additional details

Identifiers

URL
https://hal.umontpellier.fr/hal-03917930
URN
urn:oai:HAL:hal-03917930v1

Origin repository

Origin repository
UNICA