Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data

Am J Epidemiol. 2013 May 1;177(9):904-12. doi: 10.1093/aje/kws340. Epub 2013 Apr 4.

Abstract

Outcome misclassification is widespread in epidemiology, but methods to account for it are rarely used. We describe the use of multiple imputation to reduce bias when validation data are available for a subgroup of study participants. This approach is illustrated using data from 308 participants in the multicenter Herpetic Eye Disease Study between 1992 and 1998 (48% female; 85% white; median age, 49 years). The odds ratio comparing the acyclovir group with the placebo group on the gold-standard outcome (physician-diagnosed herpes simplex virus recurrence) was 0.62 (95% confidence interval (CI): 0.35, 1.09). We masked ourselves to physician diagnosis except for a 30% validation subgroup used to compare methods. Multiple imputation (odds ratio (OR) = 0.60; 95% CI: 0.24, 1.51) was compared with naive analysis using self-reported outcomes (OR = 0.90; 95% CI: 0.47, 1.73), analysis restricted to the validation subgroup (OR = 0.57; 95% CI: 0.20, 1.59), and direct maximum likelihood (OR = 0.62; 95% CI: 0.26, 1.53). In simulations, multiple imputation and direct maximum likelihood had greater statistical power than did analysis restricted to the validation subgroup, yet all 3 provided unbiased estimates of the odds ratio. The multiple-imputation approach was extended to estimate risk ratios using log-binomial regression. Multiple imputation has advantages regarding flexibility and ease of implementation for epidemiologists familiar with missing data methods.

Keywords: Monte Carlo method; bias(epidemiology); logistic regression; sensitivity and specificity.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Acyclovir / therapeutic use*
  • Antiviral Agents / therapeutic use
  • Bias*
  • Computer Simulation
  • Data Interpretation, Statistical
  • Eye Diseases / drug therapy*
  • Eye Diseases / virology
  • Female
  • Herpes Simplex / drug therapy*
  • Humans
  • Logistic Models
  • Male
  • Middle Aged
  • Monte Carlo Method
  • Multicenter Studies as Topic
  • Odds Ratio
  • Outcome Assessment, Health Care / methods
  • Outcome Assessment, Health Care / standards*
  • Outcome Assessment, Health Care / statistics & numerical data
  • Regression Analysis
  • Reproducibility of Results
  • Research Design
  • Secondary Prevention

Substances

  • Antiviral Agents
  • Acyclovir