Analyses of 'change scores' do not estimate causal effects in observational data

Int J Epidemiol. 2022 Oct 13;51(5):1604-1615. doi: 10.1093/ije/dyab050.

Abstract

Background: In longitudinal data, it is common to create 'change scores' by subtracting measurements taken at baseline from those taken at follow-up, and then to analyse the resulting 'change' as the outcome variable. In observational data, this approach can produce misleading causal-effect estimates. The present article uses directed acyclic graphs (DAGs) and simple simulations to provide an accessible explanation for why change scores do not estimate causal effects in observational data.

Methods: Data were simulated to match three general scenarios in which the outcome variable at baseline was a (i) 'competing exposure' (i.e. a cause of the outcome that is neither caused by nor causes the exposure), (ii) confounder or (iii) mediator for the total causal effect of the exposure variable at baseline on the outcome variable at follow-up. Regression coefficients were compared between change-score analyses and the appropriate estimator(s) for the total and/or direct causal effect(s).

Results: Change-score analyses do not provide meaningful causal-effect estimates unless the baseline outcome variable is a 'competing exposure' for the effect of the exposure on the outcome at follow-up. Where the baseline outcome is a confounder or mediator, change-score analyses evaluate obscure estimands, which may diverge substantially in magnitude and direction from the total and direct causal effects.

Conclusion: Future observational studies that seek causal-effect estimates should avoid analysing change scores and adopt alternative analytical strategies.

Keywords: Analysis of change; change scores; change-from-baseline variables; difference scores; directed acyclic graphs; gain scores.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Causality
  • Confounding Factors, Epidemiologic*
  • Humans