Bayesian Inference of Reticulate Phylogenies under the Multispecies Network Coalescent

PLoS Genet. 2016 May 4;12(5):e1006006. doi: 10.1371/journal.pgen.1006006. eCollection 2016 May.

Abstract

The multispecies coalescent (MSC) is a statistical framework that models how gene genealogies grow within the branches of a species tree. The field of computational phylogenetics has witnessed an explosion in the development of methods for species tree inference under MSC, owing mainly to the accumulating evidence of incomplete lineage sorting in phylogenomic analyses. However, the evolutionary history of a set of genomes, or species, could be reticulate due to the occurrence of evolutionary processes such as hybridization or horizontal gene transfer. We report on a novel method for Bayesian inference of genome and species phylogenies under the multispecies network coalescent (MSNC). This framework models gene evolution within the branches of a phylogenetic network, thus incorporating reticulate evolutionary processes, such as hybridization, in addition to incomplete lineage sorting. As phylogenetic networks with different numbers of reticulation events correspond to points of different dimensions in the space of models, we devise a reversible-jump Markov chain Monte Carlo (RJMCMC) technique for sampling the posterior distribution of phylogenetic networks under MSNC. We implemented the methods in the publicly available, open-source software package PhyloNet and studied their performance on simulated and biological data. The work extends the reach of Bayesian inference to phylogenetic networks and enables new evolutionary analyses that account for reticulation.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bayes Theorem
  • Computational Biology / methods
  • Evolution, Molecular*
  • Gene Transfer, Horizontal
  • Genome
  • Markov Chains
  • Models, Genetic*
  • Phylogeny*

Associated data

  • Dryad/10.5061/dryad.n2606

Grants and funding

This work was supported in part by National Science Foundation (http://www.nsf.gov) grants CCF-1302179 and DBI-1355998. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.