Population median imputation was noninferior to complex approaches for imputing missing values in cardiovascular prediction models in clinical practice

J Clin Epidemiol. 2022 May:145:70-80. doi: 10.1016/j.jclinepi.2022.01.011. Epub 2022 Jan 21.

Abstract

Objectives: To compare the validity and robustness of five methods for handling missing characteristics when using cardiovascular disease risk prediction models for individual patients in a real-world clinical setting.

Study design and setting: The performance of the missing data methods was assessed using data from the Swedish National Diabetes Registry (n = 419,533) with external validation using the Scottish Care Information - diabetes database (n = 226,953). Five methods for handling missing data were compared. Two methods using submodels for each combination of available data, two imputation methods: conditional imputation and median imputation, and one alternative modeling method, called the naïve approach, based on hazard ratios and populations statistics of known risk factors only. The validity was compared using calibration plots and c-statistics.

Results: C-statistics were similar across methods in both development and validation data sets, that is, 0.82 (95% CI 0.82-0.83) in the Swedish National Diabetes Registry and 0.74 (95% CI 0.74-0.75) in Scottish Care Information-diabetes database. Differences were only observed after random introduction of missing data in the most important predictor variable (i.e., age).

Conclusion: Validity and robustness of median imputation was not dissimilar to more complex methods for handling missing values, provided that the most important predictor variables, such as age, are not missing.

Keywords: Cardiovascular risk prediction; Epidemiology; Missing patient characteristics; Real-world setting; clinical practise.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Collection / methods
  • Databases, Factual
  • Diabetes Mellitus* / epidemiology
  • Humans
  • Proportional Hazards Models
  • Research Design*