Categories
Nevin Manimala Statistics

Lexique 4: A major upgrade of the Lexique French lexical database

Behav Res Methods. 2026 Apr 24;58(5):140. doi: 10.3758/s13428-026-02967-5.

ABSTRACT

Lexique 4, an updated French lexical database, expands upon its predecessor, Lexique 3, by incorporating several significant improvements to enhance its utility in psycholinguistics, computational linguistics, and education. The new version is based on a larger corpus of 316 million words derived from 65,317 documents, including movie, TV show, and documentary subtitles, which offers more accurate frequency estimates and includes contemporary neologisms. Lexique 4 introduces new variables, such as orthographic surface frequency, contextual diversity (CD), and detailed morphological structure, which provide a more comprehensive view of lexical properties. We find that contextual diversity is a slightly better predictor than word frequency, in line with previous work. Moreover, the integration of lexical decision times from the French Lexicon Project into Lexique 4 facilitates more in-depth linguistic research. Enhancements to the user interface, including a redesigned web platform, enable dynamic searches and sorting capabilities, increasing accessibility and usability for researchers. Statistical analyses indicate that the updated frequency measures in Lexique 4 are better predictors of lexical decision times compared to Lexique 3, supporting the value of these enhancements. Overall, Lexique 4 represents a comprehensive and flexible tool for analyzing French lexical properties, making it an essential asset for a broad range of users.

PMID:42030008 | DOI:10.3758/s13428-026-02967-5

By Nevin Manimala

Portfolio Website for Nevin Manimala