External Validation of a Machine Learning Based Delirium Prediction Software in Clinical Routine

Stud Health Technol Inform. 2022 May 16:293:93-100. doi: 10.3233/SHTI220353.

Abstract

Background: Various machine learning (ML) models have been developed for the prediction of clinical outcomes, but there is missing evidence on their performance in clinical routine and external validation.

Objectives: Our aim was to deploy and prospectively evaluate an already developed delirium prediction software in clinical routine of an external hospital.

Methods: We compared updated ML models of the software and models re-trained with the external hospital's data. The best models were deployed in clinical routine for one month, and risk predictions for all admitted patients were compared to the risk ratings of a senior physician. After using the software, clinicians completed a questionnaire assessing technology acceptance.

Results: Re-trained models achieved a high discriminative performance (AUROC > 0.92). Compared to clinical risk ratings, the software achieved a sensitivity of 100.0% and a specificity of 90.6%. Usefulness, ease of use and output quality were rated positively by the users.

Conclusion: A ML based delirium prediction software achieved a high discriminative performance and high technology acceptance at an external hospital using re-trained ML models.

Keywords: Machine learning; clinical decision support; clinical prediction models; delirium; electronic health records; external validation.

MeSH terms

  • Delirium* / diagnosis
  • Electronic Health Records*
  • Hospitalization
  • Humans
  • Machine Learning
  • Software