Using Natural Language Processing to Extract Abnormal Results From Cancer Screening Reports

Carlton R Moore; Ashraf Farrag; Evan Ashkin

doi:10.1097/PTS.0000000000000127

Using Natural Language Processing to Extract Abnormal Results From Cancer Screening Reports

J Patient Saf. 2017 Sep;13(3):138-143. doi: 10.1097/PTS.0000000000000127.

Authors

Carlton R Moore¹, Ashraf Farrag, Evan Ashkin

Affiliation

¹ From the *Division of General Medicine and Clinical Epidemiology, Department of Medicine, School of Medicine, †The North Carolina Translational and Clinical Sciences Center, and ‡Department of family Medicine, School of Medicine, University of North Carolina, Chapel Hill, North Carolina.

Abstract

Objectives: Numerous studies show that follow-up of abnormal cancer screening results, such as mammography and Papanicolaou (Pap) smears, is frequently not performed in a timely manner. A contributing factor is that abnormal results may go unrecognized because they are buried in free-text documents in electronic medical records (EMRs), and, as a result, patients are lost to follow-up. By identifying abnormal results from free-text reports in EMRs and generating alerts to clinicians, natural language processing (NLP) technology has the potential for improving patient care. The goal of the current study was to evaluate the performance of NLP software for extracting abnormal results from free-text mammography and Pap smear reports stored in an EMR.

Methods: A sample of 421 and 500 free-text mammography and Pap reports, respectively, were manually reviewed by a physician, and the results were categorized for each report. We tested the performance of NLP to extract results from the reports. The 2 assessments (criterion standard versus NLP) were compared to determine the precision, recall, and accuracy of NLP.

Results: When NLP was compared with manual review for mammography reports, the results were as follows: precision, 98% (96%-99%); recall, 100% (98%-100%); and accuracy, 98% (96%-99%). For Pap smear reports, the precision, recall, and accuracy of NLP were all 100%.

Conclusions: Our study developed NLP models that accurately extract abnormal results from mammography and Pap smear reports. Plans include using NLP technology to generate real-time alerts and reminders for providers to facilitate timely follow-up of abnormal results.

MeSH terms

Adult
Early Detection of Cancer / methods*
Female
Humans
Mass Screening
Middle Aged
Natural Language Processing*
Neoplasms / diagnosis*
Young Adult

Grants and funding

UL1 TR001111/TR/NCATS NIH HHS/United States