TuLeD (Tupían lexical database): introducing a database of a South American language family

Lang Resour Eval. 2021;55(4):997-1015. doi: 10.1007/s10579-020-09521-5. Epub 2021 Jan 13.

Abstract

The last two decades witnessed a rapid growth of publicly accessible online language resources. This has allowed for valuable data on lesser known languages to become available. Such resources provide linguists with opportunities for advancing their research. Yet despite the proliferation of lexical and morphological databases, the ca. 456 languages spoken in South America are poorly represented, particularly the Tupían family, which is the largest on the continent. This paper therefore introduces and discusses TuLeD, a lexical database exclusively devoted to a South American language family. It provides a comprehensive list of lexical items presented in a unified transcription for all languages with cognacy assignment and relevant (cultural or linguistic) notes. One of the main goals of TuLeD is to become a full-fledged database and a benchmark for linguistic studies on South American languages in general and the Tupían family in particular.

Keywords: Lexical database; Linguistics; South American languages; Tupí-Guaraní; Tupían.