Protein database searches for multiple alignments

Proc Natl Acad Sci U S A. 1990 Jul;87(14):5509-13. doi: 10.1073/pnas.87.14.5509.

Abstract

Protein database searches frequently can reveal biologically significant sequence relationships useful in understanding structure and function. Weak but meaningful sequence patterns can be obscured, however, by other similarities due only to chance. By searching a database for multiple as opposed to pairwise alignments, distant relationships are much more easily distinguished from background noise. Recent statistical results permit the power of this approach to be analyzed. Given a typical query sequence, an algorithm described here permits the current protein database to be searched for three-sequence alignments in less than 4 min. Such searches have revealed a variety of subtle relationships that pairwise search methods would be unable to detect.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Escherichia coli / genetics
  • Humans
  • Information Systems*
  • Molecular Sequence Data
  • Proteins / genetics*
  • Retroviridae / genetics
  • Sequence Homology, Nucleic Acid

Substances

  • Proteins