Predicting protein structural classes with pseudo amino acid composition: an approach using geometric moments of cellular automaton image

J Theor Biol. 2008 Oct 7;254(3):691-6. doi: 10.1016/j.jtbi.2008.06.016. Epub 2008 Jun 24.

Abstract

A novel approach was developed for predicting the structural classes of proteins based on their sequences. It was assumed that proteins belonging to the same structural class must bear some sort of similar texture on the images generated by the cellular automaton evolving rule [Wolfram, S., 1984. Cellular automation as models of complexity. Nature 311, 419-424]. Based on this, two geometric invariant moment factors derived from the image functions were used as the pseudo amino acid components [Chou, K.C., 2001. Prediction of protein cellular attributes using pseudo amino acid composition. Proteins: Struct., Funct., Genet. (Erratum: ibid., 2001, vol. 44, 60) 43, 246-255] to formulate the protein samples for statistical prediction. The success rates thus obtained on a previously constructed benchmark dataset are quite promising, implying that the cellular automaton image can help to reveal some inherent and subtle features deeply hidden in a pile of long and complicated amino acid sequences.

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Databases, Protein
  • Image Processing, Computer-Assisted / methods
  • Protein Folding
  • Proteins / chemistry
  • Proteins / classification*

Substances

  • Proteins