Curating gene variant databases (LSDBs): toward a universal standard

Hum Mutat. 2012 Feb;33(2):291-7. doi: 10.1002/humu.21626. Epub 2011 Nov 3.


Gene variant databases or Locus-Specific DataBases (LSDBs) are used to collect and display information on sequence variants on a gene-by-gene basis. Their most frequent use is in relation to DNA-based diagnostics, giving clinicians and scientists easy access to an up-to-date overview of all gene variants identified worldwide and whether they influence the function of the gene ("pathogenic or not"). While literature on gene variant databases is extensive, little has been published on the process of database curation itself. Based on our extensive experience as LSDB curators and our contributions to database curation courses, we discuss the subject of database curation. We describe the tasks involved, the steps to take, and the issues that might occur. Our overview is a first step toward establishing overall guidelines for database curation and ultimately covers one aspect of establishing quality-assured gene variant databases.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • Computational Biology / standards
  • Databases, Genetic / standards*
  • Genes*
  • Genetic Variation*
  • Humans
  • Internet
  • Software