Predicting protein secondary structure with a nearest-neighbor algorithm

S Salzberg; S Cost

doi:10.1016/0022-2836(92)90892-n

Predicting protein secondary structure with a nearest-neighbor algorithm

J Mol Biol. 1992 Sep 20;227(2):371-4. doi: 10.1016/0022-2836(92)90892-n.

Authors

S Salzberg¹, S Cost

Affiliation

¹ Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218.

PMID: 1404357
DOI: 10.1016/0022-2836(92)90892-n

Abstract

We have developed a new method for protein secondary structure prediction that achieves accuracies as high as 71.0%, the highest value yet reported. The main component of our method is a nearest-neighbor algorithm that uses a more sophisticated treatment of the feature space than standard nearest-neighbor methods. It calculates distance tables that allow it to produce real-valued distances between amino acid residues, and attaches weights to the instances to further modify the the structure of feature space. The algorithm, which is closely related to the memory-based reasoning method of Zhang et al., is simple and easy to train, and has also been applied with excellent results to the problem of identifying DNA promoter sequences.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Amino Acid Sequence
Molecular Sequence Data
Protein Structure, Secondary*