ActiveDriverDB: human disease mutations and genome variation in post-translational modification sites of proteins

Nucleic Acids Res. 2018 Jan 4;46(D1):D901-D910. doi: 10.1093/nar/gkx973.


Interpretation of genetic variation is needed for deciphering genotype-phenotype associations, mechanisms of inherited disease, and cancer driver mutations. Millions of single nucleotide variants (SNVs) in human genomes are known and thousands are associated with disease. An estimated 21% of disease-associated amino acid substitutions corresponding to missense SNVs are located in protein sites of post-translational modifications (PTMs), chemical modifications of amino acids that extend protein function. ActiveDriverDB is a comprehensive human proteo-genomics database that annotates disease mutations and population variants through the lens of PTMs. We integrated >385,000 published PTM sites with ∼3.6 million substitutions from The Cancer Genome Atlas (TCGA), the ClinVar database of disease genes, and human genome sequencing projects. The database includes site-specific interaction networks of proteins, upstream enzymes such as kinases, and drugs targeting these enzymes. We also predicted network-rewiring impact of mutations by analyzing gains and losses of kinase-bound sequence motifs. ActiveDriverDB provides detailed visualization, filtering, browsing and searching options for studying PTM-associated mutations. Users can upload mutation datasets interactively and use our application programming interface in pipelines. Integrative analysis of mutations and PTMs may help decipher molecular mechanisms of phenotypes and disease, as exemplified by case studies of TP53, BRCA2 and VHL. The open-source database is available at

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Substitution
  • Data Mining / methods
  • Databases, Genetic*
  • Databases, Protein*
  • Datasets as Topic
  • Disease / genetics*
  • Genetic Association Studies
  • Genetic Variation
  • Genome, Human
  • Genomics
  • Humans
  • Molecular Sequence Annotation
  • Mutation*
  • Polymorphism, Single Nucleotide
  • Protein Kinases / genetics
  • Protein Processing, Post-Translational / genetics*
  • Proteomics
  • Software
  • User-Computer Interface


  • Protein Kinases