Propensity score matching with clustered data. An application to the estimation of the impact of caesarean section on the Apgar score

Stat Med. 2016 May 30;35(12):2074-91. doi: 10.1002/sim.6880. Epub 2016 Feb 1.

Abstract

This article focuses on the implementation of propensity score matching for clustered data. Different approaches to reduce bias due to cluster-level confounders are considered and compared using Monte Carlo simulations. We investigated methods that exploit the clustered structure of the data in two ways: in the estimation of the propensity score model (through the inclusion of fixed or random effects) or in the implementation of the matching algorithm. In addition to a pure within-cluster matching, we also assessed the performance of a new approach, 'preferential' within-cluster matching. This approach first searches for control units to be matched to treated units within the same cluster. If matching is not possible within-cluster, then the algorithm searches in other clusters. All considered approaches successfully reduced the bias due to the omission of a cluster-level confounder. The preferential within-cluster matching approach, combining the advantages of within-cluster and between-cluster matching, showed a relatively good performance both in the presence of big and small clusters, and it was often the best method. An important advantage of this approach is that it reduces the number of unmatched units as compared with a pure within-cluster matching. We applied these methods to the estimation of the effect of caesarean section on the Apgar score using birth register data. Copyright © 2016 John Wiley & Sons, Ltd.

Keywords: Apgar score; caesarean section; clustered data; matching; propensity score; treatment effects.

MeSH terms

  • Adult
  • Algorithms
  • Apgar Score*
  • Cesarean Section / adverse effects*
  • Cesarean Section / statistics & numerical data
  • Cluster Analysis
  • Educational Status
  • Female
  • Gestational Age
  • Humans
  • Maternal Age
  • Models, Statistical
  • Monte Carlo Method
  • Pregnancy
  • Probability
  • Propensity Score*
  • Registries
  • Young Adult