ProRegIn: a regularity index for the selection of native-like tertiary structures of proteins

J Biosci. 2007 Jan;32(1):71-81. doi: 10.1007/s12038-007-0007-2.

Abstract

Automated protein tertiary structure prediction from sequence information alone remains an elusive goal to computational prescriptions. Dividing the problem into three stages viz. secondary structure prediction, generation of plausible main chain loop dihedrals and side chain dihedral optimization, considerable progress has been achieved in our laboratory (http://www.scfbio-iitd.res.in/bhageerath/index.jsp) and elsewhere for proteins with less than 100 amino acids. As a part of our on-going efforts in this direction and to facilitate tertiary structure selection/rejection in containing the combinatorial explosion of trial structures for a specified amino acid sequence, we describe here a web-enabled tool ProRegIn (Protein Regularity Index) developed based on the regularity in the Phi, Psi dihedral angles of the amino acids that constitute loop regions. We have analysed the dihedrals in loop regions in a non-redundant dataset of 7351 proteins drawn from the Protein Data Bank and categorized them as helix-like or sheet-like (regular) or irregular. We noticed that the regularity thus defined exceeds 86% for Phi barring glycine and 70% for Psi for all the amino acid side chains including glycine, compelling us to reexamine the conventional view that loops are irregular regions structurally. The regularity index is presented here as a simple tool that finds its application in protein structure analysis as a discriminatory scoring function for rapid screening before the more compute intensive atomic level energy calculations could be undertaken. The tool is made freely accessible over the internet at www.scfbio-iitd.res.in/software/proregin.jsp.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / chemistry
  • Databases, Protein
  • Internet
  • Protein Structure, Secondary
  • Protein Structure, Tertiary*
  • Proteins / chemistry*
  • Software*

Substances

  • Amino Acids
  • Proteins