DISSECT: an assignment-free Bayesian discovery method for species delimitation under the multispecies coalescent

Bioinformatics. 2015 Apr 1;31(7):991-8. doi: 10.1093/bioinformatics/btu770. Epub 2014 Nov 23.

Abstract

Motivation: The multispecies coalescent model provides a formal framework for the assignment of individual organisms to species, where the species are modeled as the branches of the sp tree. None of the available approaches so far have simultaneously co-estimated all the relevant parameters in the model, without restricting the parameter space by requiring a guide tree and/or prior assignment of individuals to clusters or species.

Results: We present DISSECT, which explores the full space of possible clusterings of individuals and species tree topologies in a Bayesian framework. It uses an approximation to avoid the need for reversible-jump Markov Chain Monte Carlo, in the form of a prior that is a modification of the birth-death prior for the species tree. It incorporates a spike near zero in the density for node heights. The model has two extra parameters: one controls the degree of approximation and the second controls the prior distribution on the numbers of species. It is implemented as part of BEAST and requires only a few changes from a standard *BEAST analysis. The method is evaluated on simulated data and demonstrated on an empirical dataset. The method is shown to be insensitive to the degree of approximation, but quite sensitive to the second parameter, suggesting that large numbers of sequences are needed to draw firm conclusions.

Availability and implementation: http://tree.bio.ed.ac.uk/software/beast/, http://www.indriid.com/dissectinbeast.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Bayes Theorem*
  • Computational Biology / methods*
  • Computer Simulation
  • Genetic Speciation*
  • Gophers / genetics*
  • Markov Chains
  • Monte Carlo Method
  • Phylogeny*
  • Silene / genetics*