New AI explained and validated deep learning approaches to accurately predict diabetes

Med Biol Eng Comput. 2025 Aug;63(8):2373-2392. doi: 10.1007/s11517-025-03338-6. Epub 2025 Mar 4.

Abstract

Diabetes is a metabolic condition that can lead to chronic illness and organ failure if it remains untreated. Accurate detection is essential to reduce these risks at an early stage. Recent advancements in predictive models show promising results. However, these models exhibit inadequate accuracy, struggle with class imbalance, and lack interpretability of the decision-making process. To overcome these issues, we propose two novel deep models for early and accurate diabetes prediction: LeDNet (inspired by LeNet and the Dual Attention Network) and HiDenNet (influenced by the highway network and DenseNet). The models are trained using the Diabetes Health Indicators dataset, which has an inherent class imbalance problem and results in biased predictions. This imbalance is mitigated by employing the majority-weighted minority over-sampling technique. Experimental findings demonstrate that LeDNet achieves an F1-score of 85%, recall of 84%, accuracy of 85%, and precision of 86%. Similarly, HiDenNet achieves accuracy, F1-score, recall, and precision of 85%, 86%, 86%, and 86%, respectively. Both proposed models outperform the state-of-the-art deep learning (DL) models. K-fold cross-validation is applied to ensure models' stability at different data splits. Local interpretable model-agnostic explanations and Shapley additive explanations techniques are utilized to enhance interpretability and overcome the traditional black-box nature of DL models. By providing both local and global insights into feature contributions, these explainable artificial intelligence techniques provide transparency to LeDNet and HiDenNet in diabetes prediction. LeDNet and HiDenNet not only improve decision-making transparency but also enhance diabetes prediction accuracy, making them reliable tools for clinical decision-making and early diagnosis.

Keywords: Deep learning; Dense network; Diabetes prediction; Dual attention network; Explainable artificial intelligence; Majority weighted minority over-sampling technique.

MeSH terms

  • Algorithms
  • Artificial Intelligence*
  • Deep Learning*
  • Diabetes Mellitus* / diagnosis
  • Humans