EsPal: one-stop shopping for Spanish word properties

Behav Res Methods. 2013 Dec;45(4):1246-58. doi: 10.3758/s13428-013-0326-1.

Abstract

This article introduces EsPal: a Web-accessible repository containing a comprehensive set of properties of Spanish words. EsPal is based on an extensible set of data sources, beginning with a 300 million token written database and a 460 million token subtitle database. Properties available include word frequency, orthographic structure and neighborhoods, phonological structure and neighborhoods, and subjective ratings such as imageability. Subword structure properties are also available in terms of bigrams and trigrams, biphones, and bisyllables. Lemma and part-of-speech information and their corresponding frequencies are also indexed. The website enables users either to upload a set of words to receive their properties or to receive a set of words matching constraints on the properties. The properties themselves are easily extensible and will be added over time as they become available. It is freely available from the following website: http://www.bcbl.eu/databases/espal/ .

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Data Display
  • Databases, Factual*
  • Humans
  • Internet
  • Language*
  • Natural Language Processing*
  • Pattern Recognition, Visual
  • Phonetics*
  • Spain
  • Speech
  • Speech Recognition Software
  • Vocabulary*