Computational analysis of evolution and conservation in a protein superfamily

Methods. 2004 Feb;32(2):73-92. doi: 10.1016/s1046-2023(03)00200-7.

Abstract

Many gene superfamilies have hundreds or thousands of members and hence pose a significant challenge when performing a large-scale phylogenetic analysis. Derivation of the most accurate alignment possible and inference of evolutionary relationships (with an appropriate measure of confidence) are significant "bottlenecks" in the process. A generally applicable strategy is outlined for identifying and aligning sequences, performing simple analysis of the resulting alignment, and inferring evolutionary relationships. Reference is made to the serpin superfamily. The 'partition cluster' method, a relatively rapid technique for extracting underlying associations from phylogenetic bootstrap trees, is also presented.

MeSH terms

  • Computational Biology / methods*
  • Conserved Sequence / genetics
  • Databases, Nucleic Acid
  • Databases, Protein
  • Evolution, Molecular*
  • Imaging, Three-Dimensional
  • Mutation / genetics
  • Phylogeny
  • Protein Structure, Secondary
  • Sequence Alignment / methods
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Serpins / chemistry
  • Serpins / genetics*
  • Software
  • Software Design
  • Structural Homology, Protein

Substances

  • Serpins