Automated misspelling detection and correction in clinical free-text records
- PMID: 25917057
- DOI: 10.1016/j.jbi.2015.04.008
Automated misspelling detection and correction in clinical free-text records
Abstract
Accurate electronic health records are important for clinical care and research as well as ensuring patient safety. It is crucial for misspelled words to be corrected in order to ensure that medical records are interpreted correctly. This paper describes the development of a spelling correction system for medical text. Our spell checker is based on Shannon's noisy channel model, and uses an extensive dictionary compiled from many sources. We also use named entity recognition, so that names are not wrongly corrected as misspellings. We apply our spell checker to three different types of free-text data: clinical notes, allergy entries, and medication orders; and evaluate its performance on both misspelling detection and correction. Our spell checker achieves detection performance of up to 94.4% and correction accuracy of up to 88.2%. We show that high-performance spelling correction is possible on a variety of clinical documents.
Keywords: Electronic health record; Named entity recognition; Natural language processing; Spelling correction.
Copyright © 2015 Elsevier Inc. All rights reserved.
Similar articles
-
Improving Terminology Mapping in Clinical Text with Context-Sensitive Spelling Correction.Stud Health Technol Inform. 2017;235:241-245. Stud Health Technol Inform. 2017. PMID: 28423790
-
Automated Spelling Correction for Clinical Text Mining in Russian.Stud Health Technol Inform. 2020 Jun 16;270:43-47. doi: 10.3233/SHTI200119. Stud Health Technol Inform. 2020. PMID: 32570343
-
Automated Misspelling Detection and Correction in Persian Clinical Text.J Digit Imaging. 2020 Jun;33(3):555-562. doi: 10.1007/s10278-019-00296-y. J Digit Imaging. 2020. PMID: 31823185 Free PMC article.
-
Hierarchical semantic structures for medical NLP.Stud Health Technol Inform. 2013;192:1194. Stud Health Technol Inform. 2013. PMID: 23920968 Review.
-
Practical Implementation and Challenges of Artificial Intelligence-Driven Electronic Health Record Evaluation: Protected Health Information.Adv Chronic Kidney Dis. 2022 Sep;29(5):427-430. doi: 10.1053/j.ackd.2022.05.003. Adv Chronic Kidney Dis. 2022. PMID: 36253025 Review.
Cited by
-
Improving the quality of Persian clinical text with a novel spelling correction system.BMC Med Inform Decis Mak. 2024 Aug 5;24(1):220. doi: 10.1186/s12911-024-02613-0. BMC Med Inform Decis Mak. 2024. PMID: 39103825 Free PMC article.
-
A comparison of two user-friendly methods to identify and support correction of misspelled medications.Prev Med Rep. 2024 May 17;43:102765. doi: 10.1016/j.pmedr.2024.102765. eCollection 2024 Jul. Prev Med Rep. 2024. PMID: 38798907 Free PMC article.
-
Context-Sensitive Spelling Correction of Clinical Text via Conditional Independence.Proc Mach Learn Res. 2022 Apr;174:234-247. Proc Mach Learn Res. 2022. PMID: 38665367 Free PMC article.
-
A scoping review of preprocessing methods for unstructured text data to assess data quality.Int J Popul Data Sci. 2022 Oct 4;7(1):1757. doi: 10.23889/ijpds.v6i1.1757. eCollection 2022. Int J Popul Data Sci. 2022. PMID: 37670734 Free PMC article. Review.
-
Electronic health record data quality assessment and tools: a systematic review.J Am Med Inform Assoc. 2023 Sep 25;30(10):1730-1740. doi: 10.1093/jamia/ocad120. J Am Med Inform Assoc. 2023. PMID: 37390812 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
