Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation

J Mol Biol. 2001 Mar 23;307(2):683-706. doi: 10.1006/jmbi.2001.4510.

Abstract

We have developed a formalism and a computational method for analyzing the potential functional consequences of non-synonymous single nucleotide polymorphisms. Our approach uses a structural model and phylogenetic information to derive a selection of structure and sequence-based features serving as indicators of an amino acid polymorphim's effect on function. The feature values can be integrated into a probabilistic assessment of whether an amino acid polymorphism will affect the function or stability of a target protein. The method has been validated with data sets of unbiased mutations in the lac repressor and lysoyzyme. Applying our methodology to recent surveys of genetic variation in the coding regions of clinically important genes, we estimate that approximately 26-32 % of the natural non-synonymous single nucleotide polymorphisms have effects on function. This estimate suggests that a typical person will have about 6240-12,800 heterozygous loci that encode proteins with functional variation due to natural amino acid polymorphism.

MeSH terms

  • Amino Acids / chemistry
  • Amino Acids / genetics*
  • Analysis of Variance
  • Artificial Intelligence
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Computer Simulation
  • Databases, Factual
  • Escherichia coli Proteins*
  • Forecasting
  • Genome, Human*
  • Humans
  • Lac Repressors
  • Likelihood Functions
  • Mathematical Computing
  • Models, Molecular
  • Models, Statistical*
  • Muramidase / genetics
  • Muramidase / metabolism
  • Mutation
  • Phylogeny
  • Polymorphism, Single Nucleotide*
  • Protein O-Methyltransferase
  • Proteins / chemistry
  • Proteins / metabolism*
  • Repressor Proteins / genetics
  • Repressor Proteins / metabolism
  • Sequence Analysis / methods*

Substances

  • Amino Acids
  • Bacterial Proteins
  • Escherichia coli Proteins
  • Lac Repressors
  • Proteins
  • Repressor Proteins
  • Protein O-Methyltransferase
  • Muramidase