Machine Learning Approaches for Early Prostate Cancer Prediction Based on Healthcare Utilization Patterns

Stud Health Technol Inform. 2022 Jan 14:289:65-68. doi: 10.3233/SHTI210860.

Abstract

The goal of this study was to build a machine learning model for early prostate cancer prediction based on healthcare utilization patterns. We examined the frequency and pattern changes of healthcare utilization in 2916 prostate cancer patients 3 years prior to their prostate cancer diagnoses and explored several supervised machine learning techniques to predict possible prostate cancer diagnosis. Analysis of patients' medical activities between 1 year and 2 years prior to their prostate cancer diagnoses using XGBoost model provided the best prediction accuracy with high F1 score (0.9) and AUC score (0.73). These pilot results indicated that application of machine learning to healthcare utilization patterns may result in early identification of prostate cancer diagnosis.

Keywords: Big Data Analytics; Machine Learning; Prostate Cancer.

MeSH terms

  • Humans
  • Machine Learning*
  • Male
  • Patient Acceptance of Health Care
  • Prostatic Neoplasms* / diagnosis
  • Prostatic Neoplasms* / epidemiology
  • Prostatic Neoplasms* / therapy
  • Supervised Machine Learning