A vector projection method for predicting the specificity of GalNAc-transferase

Proteins. 1995 Feb;21(2):118-26. doi: 10.1002/prot.340210205.


The specificity of UDP-GalNAc:polypeptide N-acetylgalactosaminytransferase (GalNAc-transferase) is consistent with the existence of an extended site composed of nine subsites, denoted by P4, P3, P2, P1, P0, P1', P2', P3', P4', where the acceptor at P0 is being either Ser or Thr. To predict whether a peptide will react with the enzyme to form a Ser- or Thr-conjugated glycopeptide, a vector projection method is proposed which uses a training set of amino acid sequences surrounding 90 Ser and 106 Thr O-glycosylation sites extracted from the National Biomedical Research Foundation Protein Database. The model postulates independent interactions of the 9 amino acid moieties with their respective binding sites. The high ratio of correct predictions vs. total predictions for the data in both the training and the testing sets indicates that the method is self-consistent and efficient. It provides a rapid means for predicting O-glycosylation and designing effective inhibitors of GalNAc-transferase.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Binding Sites / genetics
  • Glycosylation
  • Humans
  • In Vitro Techniques
  • Molecular Sequence Data
  • N-Acetylgalactosaminyltransferases / chemistry
  • N-Acetylgalactosaminyltransferases / genetics
  • N-Acetylgalactosaminyltransferases / metabolism*
  • Peptide Fragments / chemistry
  • Peptide Fragments / genetics
  • Peptide Fragments / metabolism
  • Serine / chemistry
  • Threonine / chemistry


  • Peptide Fragments
  • Threonine
  • Serine
  • N-Acetylgalactosaminyltransferases