Which supervised machine learning algorithm can best predict achievement of minimum clinically important difference in neck pain after surgery in patients with cervical myelopathy? A QOD study

Christine Park; Praveen V Mummaneni; Oren N Gottfried; Christopher I Shaffrey; Anthony J Tang; Erica F Bisson; Anthony L Asher; Domagoj Coric; Eric A Potts; Kevin T Foley; Michael Y Wang; Kai-Ming Fu; Michael S Virk; John J Knightly; Scott Meyer; Paul Park; Cheerag Upadhyaya; Mark E Shaffrey; Avery L Buchholz; Luis M Tumialán; Jay D Turner; Brandon A Sherrod; Nitin Agarwal; Dean Chou; Regis W Haid; Mohamad Bydon; Andrew K Chan

doi:10.3171/2023.3.FOCUS2372

Which supervised machine learning algorithm can best predict achievement of minimum clinically important difference in neck pain after surgery in patients with cervical myelopathy? A QOD study

Neurosurg Focus. 2023 Jun;54(6):E5. doi: 10.3171/2023.3.FOCUS2372.

Authors

Christine Park¹, Praveen V Mummaneni², Oren N Gottfried¹, Christopher I Shaffrey¹, Anthony J Tang³, Erica F Bisson⁴, Anthony L Asher⁵, Domagoj Coric⁵, Eric A Potts⁶, Kevin T Foley⁷, Michael Y Wang⁸, Kai-Ming Fu⁹, Michael S Virk⁹, John J Knightly¹⁰, Scott Meyer¹⁰, Paul Park¹¹, Cheerag Upadhyaya¹², Mark E Shaffrey¹³, Avery L Buchholz¹³, Luis M Tumialán¹⁴, Jay D Turner¹⁴, Brandon A Sherrod⁴, Nitin Agarwal¹⁵, Dean Chou³, Regis W Haid¹⁶, Mohamad Bydon¹⁷, Andrew K Chan³

Affiliations

¹ 1Department of Neurosurgery, Duke University, Durham, North Carolina.
² 2Department of Neurosurgery, University of California, San Francisco, California.
³ 3Department of Neurological Surgery, Columbia University Vagelos College of Physicians and Surgeons, The Och Spine Hospital at NewYork-Presbyterian, New York, New York.
⁴ 4Department of Neurosurgery, University of Utah, Salt Lake City, Utah.
⁵ 5Neuroscience Institute, Carolinas Healthcare System and Carolina Neurosurgery & Spine Associates, Charlotte, North Carolina.
⁶ 6Goodman Campbell Brain and Spine, Indianapolis, Indiana.
⁷ 7Department of Neurosurgery, University of Tennessee, Semmes-Murphey Neurologic and Spine Institute, Memphis, Tennessee.
⁸ 8Department of Neurosurgery, University of Miami, Florida.
⁹ 9Department of Neurosurgery, Weill Cornell Medical Center, New York, New York.
¹⁰ 10Atlantic Neurosurgical Specialists, Morristown, New Jersey.
¹¹ 11Department of Neurosurgery, University of Michigan, Ann Arbor, Michigan.
¹² 12Marion Bloch Neuroscience Institute, Saint Luke's Health System, Kansas City, Missouri.
¹³ 13Department of Neurosurgery, University of Virginia, Charlottesville, Virginia.
¹⁴ 14Barrow Neurological Institute, Phoenix, Arizona.
¹⁵ 15Department of Neurosurgery, Washington University in St. Louis, Missouri.
¹⁶ 16Atlanta Brain and Spine Care, Atlanta, Georgia; and.
¹⁷ 17Department of Neurologic Surgery, Mayo Clinic, Rochester, Minnesota.

PMID: 37283449
DOI: 10.3171/2023.3.FOCUS2372

Abstract

Objective: The purpose of this study was to evaluate the performance of different supervised machine learning algorithms to predict achievement of minimum clinically important difference (MCID) in neck pain after surgery in patients with cervical spondylotic myelopathy (CSM).

Methods: This was a retrospective analysis of the prospective Quality Outcomes Database CSM cohort. The data set was divided into an 80% training and a 20% test set. Various supervised learning algorithms (including logistic regression, support vector machine, decision tree, random forest, extra trees, gaussian naïve Bayes, k-nearest neighbors, multilayer perceptron, and extreme gradient boosted trees) were evaluated on their performance to predict achievement of MCID in neck pain at 3 and 24 months after surgery, given a set of predicting baseline features. Model performance was assessed with accuracy, F1 score, area under the receiver operating characteristic curve, precision, recall/sensitivity, and specificity.

Results: In total, 535 patients (46.9%) achieved MCID for neck pain at 3 months and 569 patients (49.9%) achieved it at 24 months. In each follow-up cohort, 501 patients (93.6%) were satisfied at 3 months after surgery and 569 patients (100%) were satisfied at 24 months after surgery. Of the supervised machine learning algorithms tested, logistic regression demonstrated the best accuracy (3 months: 0.76 ± 0.031, 24 months: 0.773 ± 0.044), followed by F1 score (3 months: 0.759 ± 0.019, 24 months: 0.777 ± 0.039) and area under the receiver operating characteristic curve (3 months: 0.762 ± 0.027, 24 months: 0.773 ± 0.043) at predicting achievement of MCID for neck pain at both follow-up time points, with fair performance. The best precision was also demonstrated by logistic regression at 3 (0.724 ± 0.058) and 24 (0.780 ± 0.097) months. The best recall/sensitivity was demonstrated by multilayer perceptron at 3 months (0.841 ± 0.094) and by extra trees at 24 months (0.817 ± 0.115). Highest specificity was shown by support vector machine at 3 months (0.952 ± 0.013) and by logistic regression at 24 months (0.747 ± 0.18).

Conclusions: Appropriate selection of models for studies should be based on the strengths of each model and the aims of the studies. For maximally predicting true achievement of MCID in neck pain, of all the predictions in this balanced data set the appropriate metric for the authors' study was precision. For both short- and long-term follow-ups, logistic regression demonstrated the highest precision of all models tested. Logistic regression performed consistently the best of all models tested and remains a powerful model for clinical classification tasks.

Keywords: Quality Outcomes Database; cervical spondylotic myelopathy; machine learning; neck pain; patient satisfaction; patient-reported outcomes.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Bayes Theorem
Humans
Neck Pain* / diagnosis
Neck Pain* / surgery
Prospective Studies
Retrospective Studies
Spinal Cord Diseases* / surgery
Supervised Machine Learning