Objective: To develop machine learning (ML) models that predict severity of head collision events (HCEs) based on preinjury variables and to investigate which variables are important to predicting severity.
Design: Data on HCEs were collected with respect to severity and 23 preinjury variables to create 2 datasets, a male dataset using men's tournaments and mixed dataset using men's and women's tournaments, to perform ML analysis. Machine learning analysis used a random forest classifier based on preinjury variables to predict HCE severity.
Setting: Four elite international soccer tournaments.
Participants: Elite athletes participating in analyzed tournaments.
Independent variables: The 23 preinjury variables collected for each HCE.
Main outcome measures: Predictive ability of the ML models and association of important variables.
Results: The ML models had an average area under the receiver operating characteristic curve for predicting HCE severity of 0.73 and 0.70 for the male and mixed datasets, respectively. The most important variables for prediction were the mechanism of injury and the event before injury. In the male dataset, the mechanisms "head-to-head" and "knee-to-head" were together significantly associated ( P = 0.0244) with severity; they were not significant in the mixed dataset ( P = 0.1113). In both datasets, the events "corner kicks" and "throw-ins" were together significantly associated with severity (male, P = 0.0001; mixed, P = 0.0004).
Conclusions: ML models accurately predicted the severity of HCE. The mechanism and event preceding injury were most important for predicting severity of HCEs. These findings support the use of ML to inform preventative measures that will mitigate the impact of these preinjury factors on player health.
Copyright © 2022 The Author(s). Published by Wolters Kluwer Health, Inc.