STRUCTFAST: protein sequence remote homology detection and alignment using novel dynamic programming and profile-profile scoring

Proteins. 2006 Sep 1;64(4):960-7. doi: 10.1002/prot.21049.

Abstract

STRUCTFAST is a novel profile-profile alignment algorithm capable of detecting weak similarities between protein sequences. The increased sensitivity and accuracy of the STRUCTFAST method are achieved through several unique features. First, the algorithm utilizes a novel dynamic programming engine capable of incorporating important information from a structural family directly into the alignment process. Second, the algorithm employs a rigorous analytical formula for profile-profile scoring to overcome the limitations of ad hoc scoring functions that require adjustable parameter training. Third, the algorithm employs Convergent Island Statistics (CIS) to compute the statistical significance of alignment scores independently for each pair of sequences. STRUCTFAST routinely produces alignments that meet or exceed the quality obtained by an expert human homology modeler, as evidenced by its performance in the latest CAFASP4 and CASP6 blind prediction benchmark experiments.

MeSH terms

  • Algorithms
  • Proteins / chemistry*
  • Sequence Alignment / methods*
  • Sequence Homology, Amino Acid*
  • Software

Substances

  • Proteins