Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Mar;16(3):336-343.
doi: 10.1016/j.jacr.2018.10.020. Epub 2018 Dec 29.

Use of Machine Learning to Identify Follow-Up Recommendations in Radiology Reports

Affiliations

Use of Machine Learning to Identify Follow-Up Recommendations in Radiology Reports

Emmanuel Carrodeguas et al. J Am Coll Radiol. 2019 Mar.

Abstract

Purpose: The aims of this study were to assess follow-up recommendations in radiology reports, develop and assess traditional machine learning (TML) and deep learning (DL) models in identifying follow-up, and benchmark them against a natural language processing (NLP) system.

Methods: This HIPAA-compliant, institutional review board-approved study was performed at an academic medical center generating >500,000 radiology reports annually. One thousand randomly selected ultrasound, radiography, CT, and MRI reports generated in 2016 were manually reviewed and annotated for follow-up recommendations. TML (support vector machines, random forest, logistic regression) and DL (recurrent neural nets) algorithms were constructed and trained on 850 reports (training data), with subsequent optimization of model architectures and parameters. Precision, recall, and F1 score were calculated on the remaining 150 reports (test data). A previously developed and validated NLP system (iSCOUT) was also applied to the test data, with equivalent metrics calculated.

Results: Follow-up recommendations were present in 12.7% of reports. The TML algorithms achieved F1 scores of 0.75 (random forest), 0.83 (logistic regression), and 0.85 (support vector machine) on the test data. DL recurrent neural nets had an F1 score of 0.71; iSCOUT also had an F1 score of 0.71. Performance of both TML and DL methods by F1 scores appeared to plateau after 500 to 700 samples while training.

Conclusions: TML and DL are feasible methods to identify follow-up recommendations. These methods have great potential for near real-time monitoring of follow-up recommendations in radiology reports.

Keywords: Machine learning; deep learning; follow-up recommendations; natural language processing; radiology report.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
(A-B) Receiver operating characteristic (ROC) curves for traditional machine learning models (blue) and optimized parameters (green). Subplots represent Support Vector Machines (A), Random Forest (B) and Logistic Regression (C). (D) Receiver operating characteristic (ROC) curve for top long short term memory deep learning architecture (100 nodes × 2 layers).
Figure 2:
Figure 2:
Average F1 scores for deep learning (DL, blue) and traditional machine learning (TML, orange) models with increasing training data (100 to 750 samples).

Similar articles

Cited by

References

    1. Hillman BJ and Goldsmith JC, “The uncritical use of high-tech medical imaging.,” N. Engl. J. Med, vol. 363, no. 1, pp. 4–6, July 2010. - PubMed
    1. Smith-Bindman R, Miglioretti DL, and Larson EB, “Rising Use Of Diagnostic Medical Imaging In A Large Integrated Health System: The use of imaging has skyrocketed in the past decade, but no one patient population or medical condition is responsible,” Health Aff. (Millwood), vol. 27, no. 6, pp. 1491–1502, 2008. - PMC - PubMed
    1. Lang K, Huang H, Lee DW, Federico V, and Menzin J, “National trends in advanced outpatient diagnostic imaging utilization: an analysis of the medical expenditure panel survey, 2000–2009,” BMC Med. Imaging, vol. 13, no. 1, p. 40, November 2013. - PMC - PubMed
    1. Sistrom C, Dreyer K, Dang P, Weilburg J, Boland G, Rosenthal D, and Thrall J, “Recommendations for Additional Imaging in Radiology Reports: Multifactorial Analysis of 5.9 Million Examinations,” Radiology, vol. 253, no. 2, pp. 453–461, 2009. - PubMed
    1. McDonald JS, Koo CW, White D, Hartman TE, Bender CE, and Sykes AMG, “Addition of the Fleischner Society Guidelines to Chest CT Examination Interpretive Reports Improves Adherence to Recommended Follow-up Care for Incidental Pulmonary Nodules,” Acad. Radiol, vol. 24, no. 3, pp. 337–344, 2017. - PMC - PubMed