Generalising uncertainty improves accuracy and safety of deep learning analytics applied to oncology

Samual MacDonald; Helena Foley; Melvyn Yap; Rebecca L Johnston; Kaiah Steven; Lambros T Koufariotis; Sowmya Sharma; Scott Wood; Venkateswar Addala; John V Pearson; Fred Roosta; Nicola Waddell; Olga Kondrashova; Maciej Trzaskowski

doi:10.1038/s41598-023-31126-5

Generalising uncertainty improves accuracy and safety of deep learning analytics applied to oncology

Sci Rep. 2023 May 6;13(1):7395. doi: 10.1038/s41598-023-31126-5.

Authors

Samual MacDonald^{1

2

3}, Helena Foley¹, Melvyn Yap¹, Rebecca L Johnston⁴, Kaiah Steven¹, Lambros T Koufariotis⁴, Sowmya Sharma^{4

5}, Scott Wood⁴, Venkateswar Addala⁴, John V Pearson⁴, Fred Roosta^{2

3}, Nicola Waddell⁴, Olga Kondrashova⁶, Maciej Trzaskowski^{7

8

9

10}

Affiliations

¹ Max Kelsen, Brisbane, QLD, Australia.
² ARC Training Centre for Information Resilience (CIRES), Brisbane, Australia.
³ The University of Queensland, Brisbane, Australia.
⁴ QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia.
⁵ ACL Pathology, Bella Vista, NSW, Australia.
⁶ QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia. olga.kondrashova@qimrberghofer.edu.au.
⁷ Max Kelsen, Brisbane, QLD, Australia. m.trzaskowski@uq.edu.au.
⁸ ARC Training Centre for Information Resilience (CIRES), Brisbane, Australia. m.trzaskowski@uq.edu.au.
⁹ The University of Queensland, Brisbane, Australia. m.trzaskowski@uq.edu.au.
¹⁰ QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia. m.trzaskowski@uq.edu.au.

Abstract

Uncertainty estimation is crucial for understanding the reliability of deep learning (DL) predictions, and critical for deploying DL in the clinic. Differences between training and production datasets can lead to incorrect predictions with underestimated uncertainty. To investigate this pitfall, we benchmarked one pointwise and three approximate Bayesian DL models for predicting cancer of unknown primary, using three RNA-seq datasets with 10,968 samples across 57 cancer types. Our results highlight that simple and scalable Bayesian DL significantly improves the generalisation of uncertainty estimation. Moreover, we designed a prototypical metric-the area between development and production curve (ADP), which evaluates the accuracy loss when deploying models from development to production. Using ADP, we demonstrate that Bayesian DL improves accuracy under data distributional shifts when utilising 'uncertainty thresholding'. In summary, Bayesian DL is a promising approach for generalising uncertainty, improving performance, transparency, and safety of DL models for deployment in the real world.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Deep Learning*
Medical Oncology
Reproducibility of Results
Uncertainty