A correlation-coefficient method to predicting protein-structural classes from amino acid compositions

Eur J Biochem. 1992 Jul 15;207(2):429-3. doi: 10.1111/j.1432-1033.1992.tb17067.x.

Abstract

A protein is usually classified into one of the following four structural classes: all alpha, all beta, (alpha + beta) and alpha/beta. In this paper, based on the maximum correlation-coefficient principle, a new formulation is proposed for predicting the structural class of a protein according to its amino acid composition. Calculations have been made for a development set of proteins from which the amino acid compositions for the standard structural classes were derived, and an independent set of proteins which are outside the development set. The former can test the self consistency of a method and the latter can test its extrapolating effectiveness. In both cases, the results showed that the new method gave a considerably higher rate of correct prediction than any of the previous methods, implying that a significant improvement has been achieved by implementing the maximum-correlation-coefficient principle in the new method.

MeSH terms

  • Algorithms
  • Amino Acids / chemistry
  • Protein Conformation
  • Proteins / chemistry*
  • Structure-Activity Relationship

Substances

  • Amino Acids
  • Proteins