Predicting protein structural classes with pseudo amino acid composition: an approach using geometric moments of cellular automaton image

Xuan Xiao; Pu Wang; Kuo-Chen Chou

doi:10.1016/j.jtbi.2008.06.016

Predicting protein structural classes with pseudo amino acid composition: an approach using geometric moments of cellular automaton image

J Theor Biol. 2008 Oct 7;254(3):691-6. doi: 10.1016/j.jtbi.2008.06.016. Epub 2008 Jun 24.

Authors

Xuan Xiao¹, Pu Wang, Kuo-Chen Chou

Affiliation

¹ Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen 33300, China. xiaoxuan0326@yahoo.com.cn

PMID: 18634802
DOI: 10.1016/j.jtbi.2008.06.016

Abstract

A novel approach was developed for predicting the structural classes of proteins based on their sequences. It was assumed that proteins belonging to the same structural class must bear some sort of similar texture on the images generated by the cellular automaton evolving rule [Wolfram, S., 1984. Cellular automation as models of complexity. Nature 311, 419-424]. Based on this, two geometric invariant moment factors derived from the image functions were used as the pseudo amino acid components [Chou, K.C., 2001. Prediction of protein cellular attributes using pseudo amino acid composition. Proteins: Struct., Funct., Genet. (Erratum: ibid., 2001, vol. 44, 60) 43, 246-255] to formulate the protein samples for statistical prediction. The success rates thus obtained on a previously constructed benchmark dataset are quite promising, implying that the cellular automaton image can help to reveal some inherent and subtle features deeply hidden in a pile of long and complicated amino acid sequences.

MeSH terms

Algorithms
Amino Acid Sequence*
Databases, Protein
Image Processing, Computer-Assisted / methods
Protein Folding
Proteins / chemistry
Proteins / classification*

Substances

Proteins