Prediction of Repeated Self-Harm in Six Months: Comparison of Traditional Psychometrics With Random Forest Algorithm

Omega (Westport). 2024 Mar;88(4):1403-1429. doi: 10.1177/00302228211060596. Epub 2021 Dec 17.

Abstract

Suicidal risk has been a significant mental health problem. However, the predictive ability for repeated self-harm (SH) has not improved over the past decades. This study thus aimed to explore a potential tool with theoretical accommodation and clinical application by employing traditional logistic regression (LR) and newly developed machine learning, random forest algorithm (RF). Starting with 89 items from six commonly used scales (i.e., proximal suicide risk factors) as preliminary predictors, both LR and RF resulted in a better solution with much fewer items in two phases of item selections and analyses, with prediction accuracy 88.6% and 79.8%, respectively. A combination with 12 selected items, named LR-12, well predicted repeated self-harm in 6-month follow-up with satisfactory performance (AUC = 0.84, 95% CI: 0.76-0.92; cut-off point by 1/2 with sensitivity 81.1% and specificity 74.0%). The psychometrically appealing LR-12 could be used as a screening scale for suicide risk assessment.

Keywords: machine learning; random forest; self-harm; self-injury; suicide.

MeSH terms

  • Humans
  • Psychometrics
  • Random Forest*
  • Self-Injurious Behavior* / diagnosis
  • Self-Injurious Behavior* / prevention & control
  • Self-Injurious Behavior* / psychology