Using pseudo-amino acid composition and support vector machine to predict protein structural class

Chao Chen; Yuan-Xin Tian; Xiao-Yong Zou; Pei-Xiang Cai; Jin-Yuan Mo

doi:10.1016/j.jtbi.2006.06.025

Using pseudo-amino acid composition and support vector machine to predict protein structural class

J Theor Biol. 2006 Dec 7;243(3):444-8. doi: 10.1016/j.jtbi.2006.06.025. Epub 2006 Jul 1.

Authors

Chao Chen¹, Yuan-Xin Tian, Xiao-Yong Zou, Pei-Xiang Cai, Jin-Yuan Mo

Affiliation

¹ School of Chemistry and Chemical Engineering, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China.

PMID: 16908032
DOI: 10.1016/j.jtbi.2006.06.025

Abstract

As a result of genome and other sequencing projects, the gap between the number of known protein sequences and the number of known protein structural classes is widening rapidly. In order to narrow this gap, it is vitally important to develop a computational prediction method for fast and accurately determining the protein structural class. In this paper, a novel predictor is developed for predicting protein structural class. It is featured by employing a support vector machine learning system and using a different pseudo-amino acid composition (PseAA), which was introduced to, to some extent, take into account the sequence-order effects to represent protein samples. As a demonstration, the jackknife cross-validation test was performed on a working dataset that contains 204 non-homologous proteins. The predicted results are very encouraging, indicating that the current predictor featured with the PseAA may play an important complementary role to the elegant covariant discriminant predictor and other existing algorithms.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Amino Acid Sequence
Amino Acids / genetics*
Computational Biology
Models, Chemical*
Pattern Recognition, Automated
Protein Structure, Tertiary*
Proteins / chemistry
Proteins / classification*
Proteomics

Substances

Amino Acids
Proteins