Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework

Genome Med. 2017 Dec 18;9(1):113. doi: 10.1186/s13073-017-0509-y.

Abstract

The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Congresses as Topic
  • Genome-Wide Association Study / methods*
  • Genome-Wide Association Study / standards
  • Humans
  • Polymorphism, Genetic*
  • Protein Conformation*
  • Sequence Analysis, Protein / methods*
  • Sequence Analysis, Protein / standards