Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure
- PMID: 22437073
- PMCID: PMC3422820
- DOI: 10.1136/amiajnl-2011-000535
Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure
Abstract
Objectives: Left ventricular ejection fraction (EF) is a key component of heart failure quality measures used within the Department of Veteran Affairs (VA). Our goals were to build a natural language processing system to extract the EF from free-text echocardiogram reports to automate measurement reporting and to validate the accuracy of the system using a comparison reference standard developed through human review. This project was a Translational Use Case Project within the VA Consortium for Healthcare Informatics.
Materials and methods: We created a set of regular expressions and rules to capture the EF using a random sample of 765 echocardiograms from seven VA medical centers. The documents were randomly assigned to two sets: a set of 275 used for training and a second set of 490 used for testing and validation. To establish the reference standard, two independent reviewers annotated all documents in both sets; a third reviewer adjudicated disagreements.
Results: System test results for document-level classification of EF of <40% had a sensitivity (recall) of 98.41%, a specificity of 100%, a positive predictive value (precision) of 100%, and an F measure of 99.2%. System test results at the concept level had a sensitivity of 88.9% (95% CI 87.7% to 90.0%), a positive predictive value of 95% (95% CI 94.2% to 95.9%), and an F measure of 91.9% (95% CI 91.2% to 92.7%).
Discussion: An EF value of <40% can be accurately identified in VA echocardiogram reports.
Conclusions: An automated information extraction system can be used to accurately extract EF for quality measurement.
Conflict of interest statement
Figures
Similar articles
-
Unlocking echocardiogram measurements for heart disease research through natural language processing.BMC Cardiovasc Disord. 2017 Jun 12;17(1):151. doi: 10.1186/s12872-017-0580-8. BMC Cardiovasc Disord. 2017. PMID: 28606104 Free PMC article.
-
Development and validation of a heart failure with preserved ejection fraction cohort using electronic medical records.BMC Cardiovasc Disord. 2018 Jun 28;18(1):128. doi: 10.1186/s12872-018-0866-5. BMC Cardiovasc Disord. 2018. PMID: 29954337 Free PMC article.
-
Congestive heart failure information extraction framework for automated treatment performance measures assessment.J Am Med Inform Assoc. 2017 Apr 1;24(e1):e40-e46. doi: 10.1093/jamia/ocw097. J Am Med Inform Assoc. 2017. PMID: 27413122 Free PMC article.
-
Discerning tumor status from unstructured MRI reports--completeness of information in existing reports and utility of automated natural language processing.J Digit Imaging. 2010 Apr;23(2):119-32. doi: 10.1007/s10278-009-9215-7. Epub 2009 May 30. J Digit Imaging. 2010. PMID: 19484309 Free PMC article. Review.
-
Diagnostic Accuracy of Tissue Doppler Index E/e' for Evaluating Left Ventricular Filling Pressure and Diastolic Dysfunction/Heart Failure With Preserved Ejection Fraction: A Systematic Review and Meta-Analysis.J Am Heart Assoc. 2016 Jan 25;5(1):e002530. doi: 10.1161/JAHA.115.002530. J Am Heart Assoc. 2016. PMID: 26811160 Free PMC article. Review.
Cited by
-
Developing a Parsimonious Frailty Index for Older, Multimorbid Adults With Heart Failure Using Machine Learning.Am J Cardiol. 2023 Mar 1;190:75-81. doi: 10.1016/j.amjcard.2022.11.044. Epub 2022 Dec 23. Am J Cardiol. 2023. PMID: 36566620 Free PMC article.
-
Validation of a Natural Language Processing Algorithm for the Extraction of the Sleep Parameters from the Polysomnography Reports.Healthcare (Basel). 2022 Sep 22;10(10):1837. doi: 10.3390/healthcare10101837. Healthcare (Basel). 2022. PMID: 36292283 Free PMC article.
-
Mortality in patients with heart failure and suicidal ideation discharged to skilled nursing facilities.J Geriatr Cardiol. 2022 Mar 28;19(3):198-208. doi: 10.11909/j.issn.1671-5411.2022.03.009. J Geriatr Cardiol. 2022. PMID: 35464651 Free PMC article. No abstract available.
-
Automatic extraction of 12 cardiovascular concepts from German discharge letters using pre-trained language models.Digit Health. 2021 Nov 26;7:20552076211057662. doi: 10.1177/20552076211057662. eCollection 2021 Jan-Dec. Digit Health. 2021. PMID: 34868618 Free PMC article.
-
Identification of Five Quality Needs for Rheumatology (Text Analysis and Literature Review).Front Med (Lausanne). 2021 Oct 25;8:757102. doi: 10.3389/fmed.2021.757102. eCollection 2021. Front Med (Lausanne). 2021. PMID: 34760902 Free PMC article. Review.
References
-
- Being a Meaningful User of Electronic Health Records. http://healthit.hhs.gov/portal/server.pt/community/healthit_hhs_gov__mea... (accessed 7 Jul 2011).
-
- CMS National Quality Forum Guide for Reading the EHR Incentive Program EP measures. 2010. http://www.cms.gov/QualityMeasures/Downloads/QMGuideForReadingEHR.pdf (accessed 6 Jan 2011).
-
- The Official Web site for the Medicare and Medicaid electronic health records (EHR) Incentive Programs. https://www.cms.gov/ehrincentiveprograms/ (accessed 6 Jan 2011).
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
