Assessing the performance of fold recognition methods by means of a comprehensive benchmark

Pac Symp Biocomput. 1996:300-18.

Abstract

Recently there has been an explosion of methods for fold recognition. These methods seek to align a protein sequence to a three-dimensional structure and measure the compatibility of the sequence to the structure. In this work, we present a benchmark to assess the performance of such methods. The benchmark consists of a set of protein sequences matched by superposition to known structures. This set covers a wide range of protein families, and includes matching proteins with insignificant sequence similarity. To demonstrate the usefulness of this benchmark, we apply it here to compare different fold-recognition methods developed through the years in our group as well as several sequence-sequence substitution matrices. The results show that "global-local" alignments are superior to either local or global alignments. The most effective sequence-sequence matching matrix is the Gonnet table. The best performance overall is obtained by a method which combines the 3D-1D profiles of Bowie et al. with a substitution matrix and takes into account residue pairwise interactions.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence*
  • Computer Simulation
  • Enzymes / chemistry
  • Molecular Biology / methods
  • Protein Folding*
  • Protein Structure, Secondary*
  • Proteins / chemistry*
  • Sensitivity and Specificity
  • Sequence Alignment
  • Software

Substances

  • Enzymes
  • Proteins