Functional annotations improve the predictive score of human disease-related mutations in proteins

Hum Mutat. 2009 Aug;30(8):1237-44. doi: 10.1002/humu.21047.

Abstract

Single nucleotide polymorphisms (SNPs) are the simplest and most frequent form of human DNA variation, also valuable as genetic markers of disease susceptibility. The most investigated SNPs are missense mutations resulting in residue substitutions in the protein. Here we propose SNPs&GO, an accurate method that, starting from a protein sequence, can predict whether a mutation is disease related or not by exploiting the protein functional annotation. The scoring efficiency of SNPs&GO is as high as 82%, with a Matthews correlation coefficient equal to 0.63 over a wide set of annotated nonsynonymous mutations in proteins, including 16,330 disease-related and 17,432 neutral polymorphisms. SNPs&GO collects in unique framework information derived from protein sequence, evolutionary information, and function as encoded in the Gene Ontology terms, and outperforms other available predictive methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Abnormalities, Multiple / genetics*
  • Humans
  • Mutation*
  • Proteins / genetics*
  • Syndrome

Substances

  • Proteins