VariSNP, a benchmark database for variations from dbSNP

Hum Mutat. 2015 Feb;36(2):161-6. doi: 10.1002/humu.22727. Epub 2015 Jan 8.


For development and evaluation of methods for predicting the effects of variations, benchmark datasets are needed. Some previously developed datasets are available for this purpose, but newer and larger benchmark sets for benign variants have largely been missing. VariSNP datasets are selected from dbSNP. These subsets were filtered against disease-related variants in the ClinVar, UniProtKB/Swiss-Prot, and PhenCode databases, to identify neutral or nonpathogenic cases. All variant descriptions include mapping to reference sequences on chromosomal, genomic, coding DNA, and protein levels. The datasets will be updated with automated scripts on a regular basis and are freely available at

Keywords: benchmark; dbSNP; genetic variation; mutation; variant effect analysis; variant effect prediction; variant position mapping.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Genetic*
  • Genome, Human
  • Humans
  • Models, Genetic
  • Polymorphism, Single Nucleotide*