PASSML: combining evolutionary inference and protein secondary structure prediction

Bioinformatics. 1998;14(8):726-33. doi: 10.1093/bioinformatics/14.8.726.

Abstract

Motivation: Evolutionary models of amino acid sequences can be adapted to incorporate structure information; protein structure biologists can use phylogenetic relationships among species to improve prediction accuracy. Results : A computer program called PASSML ('Phylogeny and Secondary Structure using Maximum Likelihood') has been developed to implement an evolutionary model that combines protein secondary structure and amino acid replacement. The model is related to that of Dayhoff and co-workers, but we distinguish eight categories of structural environment: alpha helix, beta sheet, turn and coil, each further classified according to solvent accessibility, i.e. buried or exposed. The model of sequence evolution for each of the eight categories is a Markov process with discrete states in continuous time, and the organization of structure along protein sequences is described by a hidden Markov model. This paper describes the PASSML software and illustrates how it allows both the reconstruction of phylogenies and prediction of secondary structure from aligned amino acid sequences.

Availability: PASSML 'ANSI C' source code and the example data sets described here are available at http://ng-dec1.gen.cam.ac.uk/hmm/Passml.html and 'downstream' Web pages.

Contact: P.Lio@gen.cam.ac.uk

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Adenylate Kinase / genetics
  • Algorithms
  • Amino Acid Sequence
  • Animals
  • Cattle
  • Databases, Factual
  • Escherichia coli / genetics
  • Evolution, Molecular*
  • Humans
  • Neisseria meningitidis / genetics
  • Phylogeny
  • Protein Structure, Secondary*
  • Rats
  • Saccharomyces cerevisiae / genetics
  • Schizosaccharomyces / genetics
  • Sequence Alignment
  • Software*

Substances

  • Adenylate Kinase