The Persian Lexicon Project: minimized orthographic neighbourhood effects in a dense language

J Psycholinguist Res. 2022 Oct;51(5):957-979. doi: 10.1007/s10936-022-09863-x. Epub 2022 Apr 2.

Abstract

In recent years large datasets of lexical processing times have been released for several languages, including English, French, Spanish, and Dutch. Such datasets have enabled us to study, compare, and model the global effects of many psycholinguistic measures such as word frequency, orthographic neighborhood (ON) size, and word length. We have compiled and publicly released a frequency and ON dictionary of 64,546 words and 1800 plausible NWs from a language that has been relatively little studied by psycholinguists: Persian. We have also collected visual lexical decision reaction times for 1800 Persian words and nonwords. Persian offers an interesting psycholinguistic environment for several reasons, including that it has few long words and has resultantly dense orthographic neighborhoods. These characteristics provide us with an opportunity to contrast how these factors affect lexical access by comparing them to several other languages. The results suggest that sensitivity to word length and orthographic neighbourhood may reflect the statistical structure of a particular language, rather than being a universal element of lexical processing. The dictionary and LDRT data are available from https://osf.io/tb4m6/ .

Keywords: Farsi dictionary; Lexical decision; Orthographic neighborhood; Persian dictionary; Word frequency.

MeSH terms

  • Humans
  • Language*
  • Psycholinguistics*
  • Reaction Time