Integrated likelihood for phylogenomics under a no-common-mechanism model

BMC Genomics. 2020 Apr 16;21(Suppl 2):219. doi: 10.1186/s12864-020-6608-y.

Abstract

Background: Multi-locus species phylogeny inference is based on models of sequence evolution on gene trees as well as models of gene tree evolution within the branches of species phylogenies. Almost all statistical methods for this inference task assume a common mechanism across all loci as captured by a single value of each branch length of the species phylogeny.

Results: In this paper, we pursue a "no common mechanism" (NCM) model, where every gene tree evolves according to its own parameters of the species phylogeny. Based on this model, we derive an analytically integrated likelihood of both species trees and networks given the gene trees of multiple loci under an NCM model. We demonstrate the performance of inference under this integrated likelihood on both simulated and biological data.

Conclusions: The model presented here will afford opportunities for exploring connections among various criteria for estimating species phylogenies from multiple, independent loci. Furthermore, further development of this model could potentially result in more efficient methods for searching the space of species phylogenies by focusing solely on the topology of the phylogeny.

Keywords: Integrated likelihood; Multispecies coalescent; No common mechanism; Phylogenomics.

MeSH terms

  • Animals
  • Computer Simulation
  • Culicidae / genetics
  • Evolution, Molecular*
  • Genetic Speciation
  • Genomics / methods*
  • Likelihood Functions
  • Models, Genetic
  • Neural Networks, Computer
  • Phylogeny
  • Probability
  • Wills / statistics & numerical data