A Preliminary Study of Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports Using Natural Language Processing

IEEE Int Conf Healthc Inform. 2022 Jun:2022:618-619. doi: 10.1109/ichi54592.2022.00125. Epub 2022 Sep 8.

Abstract

This study aims to develop a natural language processing (NLP) tool to extract the pulmonary nodules and nodule characteristics information from free-text clinical narratives. We identified a cohort of 3,080 patients who received low dose computed tomography (LDCT) at the University of Florida health system and collected their clinical narratives including radiology reports in their electronic health records (EHRs). Then, we manually annotated 394 reports as the gold-standard corpus and explored three state-of-the-art transformer-based NLP methods. The best model achieved an F1-score of 0.9279.

Keywords: deep learning; natural language processing; nodule characteristics; pulmonary nodule.