Natural language processing improves identification of colorectal cancer testing in the electronic medical record
- PMID: 21393557
- PMCID: PMC9616628
- DOI: 10.1177/0272989X11400418
Natural language processing improves identification of colorectal cancer testing in the electronic medical record
Abstract
Background: Difficulty identifying patients in need of colorectal cancer (CRC) screening contributes to low screening rates.
Objective: To use Electronic Health Record (EHR) data to identify patients with prior CRC testing.
Design: A clinical natural language processing (NLP) system was modified to identify 4 CRC tests (colonoscopy, flexible sigmoidoscopy, fecal occult blood testing, and double contrast barium enema) within electronic clinical documentation. Text phrases in clinical notes referencing CRC tests were interpreted by the system to determine whether testing was planned or completed and to estimate the date of completed tests.
Setting: Large academic medical center.
Patients: 200 patients ≥ 50 years old who had completed ≥ 2 non-acute primary care visits within a 1-year period.
Measures: Recall and precision of the NLP system, billing records, and human chart review were compared to a reference standard of human review of all available information sources.
Results: For identification of all CRC tests, recall and precision were as follows: NLP system (recall 93%, precision 94%), chart review (74%, 98%), and billing records review (44%, 83%). Recall and precision for identification of patients in need of screening were: NLP system (recall 95%, precision 88%), chart review (99%, 82%), and billing records (99%, 67%).
Limitations: Small sample size and requirement for a robust EHR.
Conclusions: Applying NLP to EHR records detected more CRC tests than either manual chart review or billing records review alone. NLP had better precision but marginally lower recall to identify patients who were due for CRC screening than billing record review.
Conflict of interest statement
CONFLICTS OF INTEREST
None of the authors have dual commitments or conflicts of interest.
Figures
Similar articles
-
Extracting timing and status descriptors for colonoscopy testing from electronic medical records.J Am Med Inform Assoc. 2010 Jul-Aug;17(4):383-8. doi: 10.1136/jamia.2010.004804. J Am Med Inform Assoc. 2010. PMID: 20595304 Free PMC article.
-
Development of a natural language processing system to identify timing and status of colonoscopy testing in electronic medical records.AMIA Annu Symp Proc. 2009 Nov 14;2009:141. AMIA Annu Symp Proc. 2009. PMID: 20351837 Free PMC article.
-
Challenges of Developing a Natural Language Processing Method With Electronic Health Records to Identify Persons With Chronic Mobility Disability.Arch Phys Med Rehabil. 2020 Oct;101(10):1739-1746. doi: 10.1016/j.apmr.2020.04.024. Epub 2020 May 21. Arch Phys Med Rehabil. 2020. PMID: 32446905 Free PMC article.
-
Application of Natural Language Processing in Electronic Health Record Data Extraction for Navigating Prostate Cancer Care: A Narrative Review.J Endourol. 2024 Aug;38(8):852-864. doi: 10.1089/end.2023.0690. Epub 2024 May 13. J Endourol. 2024. PMID: 38613805 Review.
-
Screening for Colorectal Cancer: An Updated Systematic Review [Internet].Rockville (MD): Agency for Healthcare Research and Quality (US); 2008 Oct. Report No.: 08-05-05124-EF-1. Rockville (MD): Agency for Healthcare Research and Quality (US); 2008 Oct. Report No.: 08-05-05124-EF-1. PMID: 20722162 Free Books & Documents. Review.
Cited by
-
Cancer Registry Enrichment via Linkage with Hospital-Based Electronic Medical Records: A Pilot Investigation.J Registry Manag. 2024 Spring;51(1):41-48. J Registry Manag. 2024. PMID: 38881985 Free PMC article.
-
A general framework for developing computable clinical phenotype algorithms.J Am Med Inform Assoc. 2024 Aug 1;31(8):1785-1796. doi: 10.1093/jamia/ocae121. J Am Med Inform Assoc. 2024. PMID: 38748991
-
Does medication-related osteonecrosis of the jaw affect survival of patients with Multiple Myeloma?: Exploring a large single center database using artificial intelligence.Clin Exp Med. 2023 Dec;23(8):5215-5226. doi: 10.1007/s10238-023-01100-6. Epub 2023 Oct 7. Clin Exp Med. 2023. PMID: 37805620 Free PMC article.
-
Natural Language Processing Applications for Computer-Aided Diagnosis in Oncology.Diagnostics (Basel). 2023 Jan 12;13(2):286. doi: 10.3390/diagnostics13020286. Diagnostics (Basel). 2023. PMID: 36673096 Free PMC article. Review.
-
Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006. JCO Clin Cancer Inform. 2022. PMID: 35917480 Free PMC article. Review.
References
-
- American Cancer Society. Cancer Facts and Figures 2009. 2009. Atlanta, GA, American Cancer Society.
-
- Winawer SJ, Zauber AG, Ho MN et al. Prevention of colorectal cancer by colonoscopic polypectomy. The National Polyp Study Workgroup. New England Journal of Medicine 1993; 329:1977–1981. - PubMed
-
- U.S.Preventive Services Task Force. Screening for colorectal cancer: recommendation and rationale. Ann Intern Med 2002; 137:129–131. - PubMed
-
- Use of colorectal cancer tests--United States, 2002, 2004, and 2006. MMWR Morb Mortal Wkly Rep 2008; 57:253–258. - PubMed
-
- Swan J, Breen N, Coates RJ et al. Progress in cancer screening practices in the United States: results from the 2000 National Health Interview Survey. Cancer 2003; 97:1528–1540. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
