Identification and analysis of deleterious human SNPs

J Mol Biol. 2006 Mar 10;356(5):1263-74. doi: 10.1016/j.jmb.2005.12.025. Epub 2005 Dec 27.

Abstract

We have developed two methods of identifying which non-synonomous single base changes have a deleterious effect on protein function in vivo. One method, described elsewhere, analyzes the effect of the resulting amino acid change on protein stability, utilizing structural information. The other method, introduced here, makes use of the conservation and type of residues observed at a base change position within a protein family. A machine learning technique, the support vector machine, is trained on single amino acid changes that cause monogenic disease, with a control set of amino acid changes fixed between species. Both methods are used to identify deleterious single nucleotide polymorphisms (SNPs) in the human population. After carefully controlling for errors, we find that approximately one quarter of known non-synonymous SNPs are deleterious by these criteria, providing a set of possible contributors to human complex disease traits.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • DNA Mutational Analysis*
  • Databases, Protein
  • Evolution, Molecular
  • Genetic Predisposition to Disease
  • Genome
  • Humans
  • Mice
  • Mice, Knockout
  • Polymorphism, Single Nucleotide*
  • Proteins / chemistry
  • Proteins / genetics
  • Sensitivity and Specificity

Substances

  • Proteins