Detecting regular sound changes in linguistics as events of concerted evolution

Daniel J Hruschka; Simon Branford; Eric D Smith; Jon Wilkins; Andrew Meade; Mark Pagel; Tanmoy Bhattacharya

doi:10.1016/j.cub.2014.10.064

Detecting regular sound changes in linguistics as events of concerted evolution

Curr Biol. 2015 Jan 5;25(1):1-9. doi: 10.1016/j.cub.2014.10.064. Epub 2014 Dec 18.

Authors

Daniel J Hruschka¹, Simon Branford², Eric D Smith³, Jon Wilkins⁴, Andrew Meade², Mark Pagel⁵, Tanmoy Bhattacharya⁶

Affiliations

¹ School of Human Evolution and Social Change, Arizona State University, PO Box 872402, Tempe, AZ 85287-2402, USA.
² School of Biological Sciences, University of Reading, Reading RG6 6BX, UK.
³ The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA; Krasnow Institute for Advanced Study, George Mason University, Mail Stop 2A1, 4400 University Drive, Fairfax, VA 22030, USA.
⁴ The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA; Ronin Institute, 127 Haddon Place, Montclair, NJ 07043, USA.
⁵ School of Biological Sciences, University of Reading, Reading RG6 6BX, UK; The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA. Electronic address: m.pagel@reading.ac.uk.
⁶ The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA; T-2, Los Alamos National Laboratory, Los Alamos, NM 87545, USA. Electronic address: tanmoy@santafe.edu.

Abstract

Background: Concerted evolution is normally used to describe parallel changes at different sites in a genome, but it is also observed in languages where a specific phoneme changes to the same other phoneme in many words in the lexicon—a phenomenon known as regular sound change. We develop a general statistical model that can detect concerted changes in aligned sequence data and apply it to study regular sound changes in the Turkic language family.

Results: Linguistic evolution, unlike the genetic substitutional process, is dominated by events of concerted evolutionary change. Our model identified more than 70 historical events of regular sound change that occurred throughout the evolution of the Turkic language family, while simultaneously inferring a dated phylogenetic tree. Including regular sound changes yielded an approximately 4-fold improvement in the characterization of linguistic change over a simpler model of sporadic change, improved phylogenetic inference, and returned more reliable and plausible dates for events on the phylogenies. The historical timings of the concerted changes closely follow a Poisson process model, and the sound transition networks derived from our model mirror linguistic expectations.

Conclusions: We demonstrate that a model with no prior knowledge of complex concerted or regular changes can nevertheless infer the historical timings and genealogical placements of events of concerted change from the signals left in contemporary data. Our model can be applied wherever discrete elements—such as genes, words, cultural trends, technologies, or morphological traits—can change in parallel within an organism or other evolving group.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Cultural Evolution*
Humans
Models, Statistical
Phonetics*
Phylogeny

Grants and funding

268744/ERC_/European Research Council/International