Bridging the data gap between children and large language models

Trends Cogn Sci. 2023 Nov;27(11):990-992. doi: 10.1016/j.tics.2023.08.007. Epub 2023 Aug 31.

Abstract

Large language models (LLMs) show intriguing emergent behaviors, yet they receive around four or five orders of magnitude more language data than human children. What accounts for this vast difference in sample efficiency? Candidate explanations include children's pre-existing conceptual knowledge, their use of multimodal grounding, and the interactive, social nature of their input.

Keywords: artificial intelligence; human learning; language learning; large language models.