Prospective validation of a machine learning model that uses provider notes to identify candidates for resective epilepsy surgery

Epilepsia. 2020 Jan;61(1):39-48. doi: 10.1111/epi.16398. Epub 2019 Nov 29.

Abstract

Objective: Delay to resective epilepsy surgery results in avoidable disease burden and increased risk of mortality. The objective was to prospectively validate a natural language processing (NLP) application that uses provider notes to assign epilepsy surgery candidacy scores.

Methods: The application was trained on notes from (1) patients with a diagnosis of epilepsy and a history of resective epilepsy surgery and (2) patients who were seizure-free without surgery. The testing set included all patients with unknown surgical candidacy status and an upcoming neurology visit. Training and testing sets were updated weekly for 1 year. One- to three-word phrases contained in patients' notes were used as features. Patients prospectively identified by the application as candidates for surgery were manually reviewed by two epileptologists. Performance metrics were defined by comparing NLP-derived surgical candidacy scores with surgical candidacy status from expert chart review.

Results: The training set was updated weekly and included notes from a mean of 519 ± 67 patients. The area under the receiver operating characteristic curve (AUC) from 10-fold cross-validation was 0.90 ± 0.04 (range = 0.83-0.96) and improved by 0.002 per week (P < .001) as new patients were added to the training set. Of the 6395 patients who visited the neurology clinic, 4211 (67%) were evaluated by the model. The prospective AUC on this test set was 0.79 (95% confidence interval [CI] = 0.62-0.96). Using the optimal surgical candidacy score threshold, sensitivity was 0.80 (95% CI = 0.29-0.99), specificity was 0.77 (95% CI = 0.64-0.88), positive predictive value was 0.25 (95% CI = 0.07-0.52), and negative predictive value was 0.98 (95% CI = 0.87-1.00). The number needed to screen was 5.6.

Significance: An electronic health record-integrated NLP application can accurately assign surgical candidacy scores to patients in a clinical setting.

Keywords: clinical decision support; epilepsy surgery; machine learning; natural language processing.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.
  • Validation Study

MeSH terms

  • Adolescent
  • Adult
  • Child
  • Child, Preschool
  • Decision Support Systems, Clinical
  • Electronic Health Records*
  • Epilepsy / surgery*
  • Female
  • Humans
  • Infant
  • Infant, Newborn
  • Machine Learning*
  • Male
  • Middle Aged
  • Natural Language Processing*
  • Patient Selection*
  • Prospective Studies
  • Young Adult