Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar-Apr;21(2):299-307.
doi: 10.1136/amiajnl-2012-001506. Epub 2013 Jun 27.

A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources

Affiliations

A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources

Sungrim Moon et al. J Am Med Inform Assoc. 2014 Mar-Apr.

Abstract

Objective: To create a sense inventory of abbreviations and acronyms from clinical texts.

Methods: The most frequently occurring abbreviations and acronyms from 352,267 dictated clinical notes were used to create a clinical sense inventory. Senses of each abbreviation and acronym were manually annotated from 500 random instances and lexically matched with long forms within the Unified Medical Language System (UMLS V.2011AB), Another Database of Abbreviations in Medline (ADAM), and Stedman's Dictionary, Medical Abbreviations, Acronyms & Symbols, 4th edition (Stedman's). Redundant long forms were merged after they were lexically normalized using Lexical Variant Generation (LVG).

Results: The clinical sense inventory was found to have skewed sense distributions, practice-specific senses, and incorrect uses. Of 440 abbreviations and acronyms analyzed in this study, 949 long forms were identified in clinical notes. This set was mapped to 17,359, 5233, and 4879 long forms in UMLS, ADAM, and Stedman's, respectively. After merging long forms, only 2.3% matched across all medical resources. The UMLS, ADAM, and Stedman's covered 5.7%, 8.4%, and 11% of the merged clinical long forms, respectively. The sense inventory of clinical abbreviations and acronyms and anonymized datasets generated from this study are available for public use at http://www.bmhi.umn.edu/ihi/research/nlpie/resources/index.htm ('Sense Inventories', website).

Conclusions: Clinical sense inventories of abbreviations and acronyms created using clinical notes and medical dictionary resources demonstrate challenges with term coverage and resource integration. Further work is needed to help with standardizing abbreviations and acronyms in clinical care and biomedicine to facilitate automated processes such as text-mining and information extraction.

Keywords: Abbreviations as Topic*; Clinical sense inventory; Medical Records*; Natural Language Processing*; Word sense disambiguation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Overview of collected long forms from the Unified Medical Language System (UMLS). CUI, Concept Unique Identifier.
Figure 2
Figure 2
Merging process of long forms. Extracted Unified Medical Language System (UMLS) file=result from figure 1. ADAM, Another Database of Abbreviations in Medline; Stedman's, Stedman's Medical Abbreviations, Acronyms & Symbols; CUI, Concept Unique Identifier.
Figure 3
Figure 3
The coverage among resources. Master file and Refined Master file=result from figure 2. UMLS, Unified Medical Language System; ADAM, Another Database of Abbreviations in Medline; Stedman's, Stedman's Medical Abbreviations, Acronyms & Symbols.

Similar articles

Cited by

References

    1. Pakhomov S, Pedersen T, Chute CG. Abbreviation and acronym disambiguation in clinical discourse. AMIA Annu Symp Proc 2005:589–93 - PMC - PubMed
    1. Pakhomov S. Semi-supervised maximum entropy based approach to acronym and abbreviation normalization in medical texts. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics Philadelphia, Pennsylvania: Association for Computational Linguistics; 2002:160–7
    1. Stetson PD, Johnson SB, Scotch M, et al. The sublanguage of cross-coverage. Proc AMIA Symp 2002:742–6 - PMC - PubMed
    1. Xu H, Markatou M, Dimova R, et al. Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues. BMC Bioinformatics 2006;7:334. - PMC - PubMed
    1. Xu H, Stetson PD, Friedman C. A study of abbreviations in clinical notes. AMIA Annu Symp Proc 2007:821–5 - PMC - PubMed

Publication types