Predicting protein secondary structure with a nearest-neighbor algorithm

J Mol Biol. 1992 Sep 20;227(2):371-4. doi: 10.1016/0022-2836(92)90892-n.

Abstract

We have developed a new method for protein secondary structure prediction that achieves accuracies as high as 71.0%, the highest value yet reported. The main component of our method is a nearest-neighbor algorithm that uses a more sophisticated treatment of the feature space than standard nearest-neighbor methods. It calculates distance tables that allow it to produce real-valued distances between amino acid residues, and attaches weights to the instances to further modify the the structure of feature space. The algorithm, which is closely related to the memory-based reasoning method of Zhang et al., is simple and easy to train, and has also been applied with excellent results to the problem of identifying DNA promoter sequences.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Molecular Sequence Data
  • Protein Structure, Secondary*