Phylogenetic trees and Euclidean embeddings

J Math Biol. 2017 Jan;74(1-2):99-111. doi: 10.1007/s00285-016-1018-0. Epub 2016 May 7.

Abstract

It was recently observed by de Vienne et al. (Syst Biol 60(6):826-832, 2011) that a simple square root transformation of distances between taxa on a phylogenetic tree allowed for an embedding of the taxa into Euclidean space. While the justification for this was based on a diffusion model of continuous character evolution along the tree, here we give a direct and elementary explanation for it that provides substantial additional insight. We use this embedding to reinterpret the differences between the NJ and BIONJ tree building algorithms, providing one illustration of how this embedding reflects tree structures in data.

Keywords: Distance methods; Multidimensional scaling; Neighbor joining; Phylogenetic trees.

MeSH terms

  • Algorithms
  • Classification / methods*
  • Models, Genetic*
  • Phylogeny*