Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA

J Mol Evol. 2000 Apr;50(4):348-58. doi: 10.1007/s002399910038.


Maximum likelihood (ML) phylogenies based on 9,957 amino acid (AA) sites of 45 proteins encoded in the plastid genomes of Cyanophora, a diatom, a rhodophyte (red algae), a euglenophyte, and five land plants are compared with respect to several properties of the data, including between-site rate variation and aberrant amino acid composition in individual species. Neighbor-joining trees from AA LogDet distances and ML analyses are seen to be congruent when site rate variability was taken into account. Four feasible trees are identified in these analyses, one of which is preferred, and one of which is almost excluded by statistical criteria. A transition probability matrix for the general reversible Markov model of amino acid substitutions is estimated from the data, assuming each of these four trees. In all cases, the tree with diatom and rhodophyte as sister taxa was clearly favored. The new transition matrix based on the best tree, called cpREV, takes into account distinct substitution patterns in plastid-encoded proteins and should be useful in future ML inferences using such data. A second rate matrix, called cpREV*, based on a weighted sum of rate matrices from different trees, is also considered.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algal Proteins / chemistry
  • Algal Proteins / genetics*
  • Amino Acid Substitution*
  • DNA, Chloroplast / genetics*
  • Eukaryota / genetics
  • Evolution, Molecular
  • Genome
  • Likelihood Functions
  • Markov Chains
  • Models, Genetic
  • Phylogeny*
  • Plant Proteins / chemistry
  • Plant Proteins / genetics*
  • Plants / genetics
  • Plastids / genetics*


  • Algal Proteins
  • DNA, Chloroplast
  • Plant Proteins