A time-indexed reference standard of adverse drug reactions

Sci Data. 2014 Nov 11;1:140043. doi: 10.1038/sdata.2014.43.


Undetected adverse drug reactions (ADRs) pose a major burden on the health system. Data mining methodologies designed to identify signals of novel ADRs are of deep importance for drug safety surveillance. The development and evaluation of these methodologies requires proper reference benchmarks. While progress has recently been made in developing such benchmarks, our understanding of the performance characteristics of the data mining methodologies is limited because existing benchmarks do not support prospective performance evaluations. We address this shortcoming by providing a reference standard to support prospective performance evaluations. The reference standard was systematically curated from drug labeling revisions, such as new warnings, which were issued and communicated by the US Food and Drug Administration in 2013. The reference standard includes 62 positive test cases and 75 negative controls, and covers 44 drugs and 38 events. We provide usage guidance and empirical support for the reference standard by applying it to analyze two data sources commonly mined for drug safety surveillance.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adverse Drug Reaction Reporting Systems / standards*
  • Data Mining
  • Drug Evaluation / standards
  • Drug Labeling / standards
  • Drug-Related Side Effects and Adverse Reactions*
  • Humans
  • Reference Standards
  • Time Factors
  • United States
  • United States Food and Drug Administration

Associated data

  • figshare/10.6084/m9.figshare.1133904