Using artificially generated spectral data to improve protein secondary structure prediction from Fourier transform infrared spectra of proteins

Anal Biochem. 2004 Sep 15;332(2):238-44. doi: 10.1016/j.ab.2004.06.030.

Abstract

Secondary structures of proteins have been predicted using neural networks from their Fourier transform infrared spectra. To improve the generalization ability of the neural networks, the training data set has been artificially increased by linear interpolation. The leave-one-out approach has been used to demonstrate the applicability of the method. Bayesian regularization has been used to train the neural networks and the predictions have been further improved by the maximum-likelihood estimation method. The networks have been tested and standard error of prediction (SEP) of 4.19% for alpha helix, 3.49% for beta sheet, and 3.15% for turns have been achieved. The results indicate that there is a significant decrease in the SEP for each type of structure parameter compared to previous works.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computer Simulation*
  • Neural Networks, Computer
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Sensitivity and Specificity
  • Spectroscopy, Fourier Transform Infrared

Substances

  • Proteins