Using Protein Design for Homology Detection and Active Site Searches

Proc Natl Acad Sci U S A. 2003 Sep 30;100(20):11361-6. doi: 10.1073/pnas.2034878100. Epub 2003 Sep 15.

Abstract

We describe a method of designing artificial sequences that resemble naturally occurring sequences in terms of their compatibility with a template structure and its functional constraints. The design procedure is a Monte Carlo simulation of amino acid substitution process. The selective fixation of substitutions is dictated by a simple scoring function derived from the template structure and a multiple alignment of its homologs. Designed sequences represent an enlargement of sequence space around native sequences. We show that the use of designed sequences improves the performance of profile-based homology detection. The difference in position-specific conservation between designed sequences and native sequences is helpful for prediction of functionally important residues. Our sequence selection criteria in evolutionary simulations introduce amino acid substitution rate variation among sites in a natural way, providing a better model to test phylogenetic methods.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Binding Sites
  • Evolution, Molecular
  • Models, Chemical
  • Proteins / chemistry*
  • Proteins / metabolism

Substances

  • Proteins