Matching Weights to Simultaneously Compare Three Treatment Groups: Comparison to Three-way Matching

Kazuki Yoshida; Sonia Hernández-Díaz; Daniel H Solomon; John W Jackson; Joshua J Gagne; Robert J Glynn; Jessica M Franklin

doi:10.1097/EDE.0000000000000627

Matching Weights to Simultaneously Compare Three Treatment Groups: Comparison to Three-way Matching

Epidemiology. 2017 May;28(3):387-395. doi: 10.1097/EDE.0000000000000627.

Authors

Kazuki Yoshida¹, Sonia Hernández-Díaz, Daniel H Solomon, John W Jackson, Joshua J Gagne, Robert J Glynn, Jessica M Franklin

Affiliation

¹ From the aDepartment of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA; bDepartment of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA; cDivision of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Boston, MA; dDivision of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA; and eDepartment of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD.

Abstract

Background: Propensity score matching is a commonly used tool. However, its use in settings with more than two treatment groups has been less frequent. We examined the performance of a recently developed propensity score weighting method in the three-treatment group setting.

Methods: The matching weight method is an extension of inverse probability of treatment weighting (IPTW) that reweights both exposed and unexposed groups to emulate a propensity score matched population. Matching weights can generalize to multiple treatment groups. The performance of matching weights in the three-group setting was compared via simulation to three-way 1:1:1 propensity score matching and IPTW. We also applied these methods to an empirical example that compared the safety of three analgesics.

Results: Matching weights had similar bias, but better mean squared error (MSE) compared with three-way matching in all scenarios. The benefits were more pronounced in scenarios with a rare outcome, unequally sized treatment groups, or poor covariate overlap. IPTW's performance was highly dependent on covariate overlap. In the empirical example, matching weights achieved the best balance for 24 out of 35 covariates. Hazard ratios were numerically similar to matching. However, the confidence intervals were narrower for matching weights.

Conclusions: Matching weights demonstrated improved performance over three-way matching in terms of MSE, particularly in simulation scenarios where finding matched subjects was difficult. Given its natural extension to settings with even more than three groups, we recommend matching weights for comparing outcomes across multiple treatment groups, particularly in settings with rare outcomes or unequal exposure distributions. See video abstract at, http://links.lww.com/EDE/B188.

Publication types

Video-Audio Media
Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Adult
Analgesics / adverse effects
Analgesics, Opioid / adverse effects*
Anti-Inflammatory Agents, Non-Steroidal / adverse effects*
Cardiovascular Diseases / chemically induced*
Cyclooxygenase Inhibitors / adverse effects*
Epidemiologic Methods
Female
Fractures, Bone / chemically induced*
Gastrointestinal Hemorrhage / chemically induced*
Humans
Male
Middle Aged
Propensity Score
Proportional Hazards Models

Substances

Analgesics
Analgesics, Opioid
Anti-Inflammatory Agents, Non-Steroidal
Cyclooxygenase Inhibitors

Grants and funding

K24 AR055989/AR/NIAMS NIH HHS/United States