An application of propensity score matching using claims data

Pharmacoepidemiol Drug Saf. 2005 Jul;14(7):465-76. doi: 10.1002/pds.1062.


Background: Confounding by indication is a common problem in pharmacoepidemiology, where predictors of treatment also have prognostic value for the outcome of interest. The tools available to the epidemiologist that can be used to mitigate the effects of confounding by indication often have limits with respect to the number of variables that can be simultaneously incorporated as components of the confounding. This constraint becomes particularly apparent in the context of a rich data source (such as administrative claims data), applied to the study of an outcome that occurs infrequently. In such settings, there will typically be many more variables available for control as potential confounders than traditional epidemiologic techniques will allow.

Methods: One tool that can indirectly permit control of a large number of variables is the propensity score approach. This paper illustrates the application of the propensity score to a study conducted in an administrative database, and raises critical issues to be addressed in such an analysis. In this example, the effect of statin therapy on the occurrence of myocardial infarction was examined, and numerous potential confounders of this association were adjusted simultaneously using a propensity score to form matched cohorts of statin initiators and non-initiators.

Results: The incidence of myocardial infarction observed in the statin treated cohort was lower than the incidence in the untreated cohort, and the magnitude of this effect was consistent with results from randomized placebo controlled clinical trials of statin therapy.

Conclusions: This example illustrates how confounding by indication can be mitigated by the propensity score matching technique. Concerns remain over the generalizability of estimates obtained from such a study, and how to know when propensity scores are removing bias, since apparent balance between compared groups on measured variables could leave variables not included in the propensity score unbalanced and lead to confounded effect estimates.

Publication types

  • Comparative Study

MeSH terms

  • Cohort Studies
  • Confounding Factors, Epidemiologic
  • Data Collection / methods
  • Humans
  • Hydroxymethylglutaryl-CoA Reductase Inhibitors / therapeutic use*
  • Hypercholesterolemia / drug therapy*
  • Hypercholesterolemia / epidemiology
  • Incidence
  • Myocardial Infarction / epidemiology
  • Myocardial Infarction / prevention & control*
  • Outcome Assessment, Health Care
  • Pharmaceutical Services / statistics & numerical data*
  • Pharmacoepidemiology / methods*
  • Randomized Controlled Trials as Topic
  • Risk Factors
  • Statistics as Topic


  • Hydroxymethylglutaryl-CoA Reductase Inhibitors