A four-step strategy for handling missing outcome data in randomised trials affected by a pandemic

Suzie Cro; Tim P Morris; Brennan C Kahan; Victoria R Cornelius; James R Carpenter

doi:10.1186/s12874-020-01089-6

A four-step strategy for handling missing outcome data in randomised trials affected by a pandemic

BMC Med Res Methodol. 2020 Aug 12;20(1):208. doi: 10.1186/s12874-020-01089-6.

Authors

Suzie Cro¹, Tim P Morris^{2

3}, Brennan C Kahan², Victoria R Cornelius⁴, James R Carpenter^{2

3}

Affiliations

¹ Imperial Clinical Trials Unit, Imperial College London, Stadium House, 68 Wood Lane, London, UK. s.cro@imperial.ac.uk.
² Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK.
³ MRC Clinical Trials Unit at UCL, 90 High Holborn, London, UK.
⁴ Imperial Clinical Trials Unit, Imperial College London, Stadium House, 68 Wood Lane, London, UK.

Abstract

Background: The coronavirus pandemic (Covid-19) presents a variety of challenges for ongoing clinical trials, including an inevitably higher rate of missing outcome data, with new and non-standard reasons for missingness. International drug trial guidelines recommend trialists review plans for handling missing data in the conduct and statistical analysis, but clear recommendations are lacking.

Methods: We present a four-step strategy for handling missing outcome data in the analysis of randomised trials that are ongoing during a pandemic. We consider handling missing data arising due to (i) participant infection, (ii) treatment disruptions and (iii) loss to follow-up. We consider both settings where treatment effects for a 'pandemic-free world' and 'world including a pandemic' are of interest.

Results: In any trial, investigators should; (1) Clarify the treatment estimand of interest with respect to the occurrence of the pandemic; (2) Establish what data are missing for the chosen estimand; (3) Perform primary analysis under the most plausible missing data assumptions followed by; (4) Sensitivity analysis under alternative plausible assumptions. To obtain an estimate of the treatment effect in a 'pandemic-free world', participant data that are clinically affected by the pandemic (directly due to infection or indirectly via treatment disruptions) are not relevant and can be set to missing. For primary analysis, a missing-at-random assumption that conditions on all observed data that are expected to be associated with both the outcome and missingness may be most plausible. For the treatment effect in the 'world including a pandemic', all participant data is relevant and should be included in the analysis. For primary analysis, a missing-at-random assumption - potentially incorporating a pandemic time-period indicator and participant infection status - or a missing-not-at-random assumption with a poorer response may be most relevant, depending on the setting. In all scenarios, sensitivity analysis under credible missing-not-at-random assumptions should be used to evaluate the robustness of results. We highlight controlled multiple imputation as an accessible tool for conducting sensitivity analyses.

Conclusions: Missing data problems will be exacerbated for trials active during the Covid-19 pandemic. This four-step strategy will facilitate clear thinking about the appropriate analysis for relevant questions of interest.

Keywords: Controlled multiple imputation; Coronavirus SARS-CoV-2; Covid-19; Estimands; Missing data; Pandemic; Randomised trials; Sensitivity analysis.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Betacoronavirus / physiology
COVID-19
Comorbidity
Coronavirus Infections / epidemiology
Coronavirus Infections / therapy
Coronavirus Infections / virology
Humans
Outcome Assessment, Health Care / methods
Outcome Assessment, Health Care / statistics & numerical data*
Pandemics
Pneumonia, Viral / epidemiology
Pneumonia, Viral / therapy
Pneumonia, Viral / virology
Practice Guidelines as Topic*
Randomized Controlled Trials as Topic / methods
Randomized Controlled Trials as Topic / statistics & numerical data*
Reproducibility of Results
Research Design / statistics & numerical data*
SARS-CoV-2

Abstract

Publication types

MeSH terms

Grants and funding