Evaluating performance of risk identification methods through a large-scale simulation of observational data

Patrick B Ryan; Martijn J Schuemie

doi:10.1007/s40264-013-0110-2

Evaluating performance of risk identification methods through a large-scale simulation of observational data

Drug Saf. 2013 Oct:36 Suppl 1:S171-80. doi: 10.1007/s40264-013-0110-2.

Authors

Patrick B Ryan¹, Martijn J Schuemie

Affiliation

¹ Janssen Research and Development LLC, 1125 Trenton-Harbourton Road, Room K30205, PO Box 200, Titusville, NJ, 08560, USA, ryan@omop.org.

PMID: 24166233
DOI: 10.1007/s40264-013-0110-2

Abstract

Background: There has been only limited evaluation of statistical methods for identifying safety risks of drug exposure in observational healthcare data. Simulations can support empirical evaluation, but have not been shown to adequately model the real-world phenomena that challenge observational analyses.

Objectives: To design and evaluate a probabilistic framework (OSIM2) for generating simulated observational healthcare data, and to use this data for evaluating the performance of methods in identifying associations between drug exposure and health outcomes of interest.

Research design: Seven observational designs, including case-control, cohort, self-controlled case series, and self-controlled cohort design were applied to 399 drug-outcome scenarios in 6 simulated datasets with no effect and injected relative risks of 1.25, 1.5, 2, 4, and 10, respectively.

Subjects: Longitudinal data for 10 million simulated patients were generated using a model derived from an administrative claims database, with associated demographics, periods of drug exposure derived from pharmacy dispensings, and medical conditions derived from diagnoses on medical claims.

Measures: Simulation validation was performed through descriptive comparison with real source data. Method performance was evaluated using Area Under ROC Curve (AUC), bias, and mean squared error.

Results: OSIM2 replicates prevalence and types of confounding observed in real claims data. When simulated data are injected with relative risks (RR) ≥ 2, all designs have good predictive accuracy (AUC > 0.90), but when RR < 2, no methods achieve 100 % predictions. Each method exhibits a different bias profile, which changes with the effect size.

Conclusions: OSIM2 can support methodological research. Results from simulation suggest method operating characteristics are far from nominal properties.

MeSH terms

Adult
Computer Simulation
Databases, Factual
Drug-Related Side Effects and Adverse Reactions / diagnosis*
Female
Humans
Male
Observational Studies as Topic
Research Design*
Risk Assessment / methods*