Adjusting for partially missing baseline measurements in randomized trials

Stat Med. 2005 Apr 15;24(7):993-1007. doi: 10.1002/sim.1981.


Adjustment for baseline variables in a randomized trial can increase power to detect a treatment effect. However, when baseline data are partly missing, analysis of complete cases is inefficient. We consider various possible improvements in the case of normally distributed baseline and outcome variables. Joint modelling of baseline and outcome is the most efficient method. Mean imputation is an excellent alternative, subject to three conditions. Firstly, if baseline and outcome are correlated more than about 0.6 then weighting should be used to allow for the greater information from complete cases. Secondly, imputation should be carried out in a deterministic way, using other baseline variables if possible, but not using randomized arm or outcome. Thirdly, if baselines are not missing completely at random, then a dummy variable for missingness should be included as a covariate (the missing indicator method). The methods are illustrated in a randomized trial in community psychiatry.

MeSH terms

  • Data Interpretation, Statistical*
  • Humans
  • Models, Statistical*
  • Patient Satisfaction
  • Psychotic Disorders / therapy
  • Randomized Controlled Trials as Topic / methods*
  • Sample Size