Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Nov 16:2013:103-10.
eCollection 2013.

On-time clinical phenotype prediction based on narrative reports

Affiliations

On-time clinical phenotype prediction based on narrative reports

Cosmin A Bejan et al. AMIA Annu Symp Proc. .

Abstract

In this paper we describe a natural language processing system which is able to predict whether or not a patient exhibits a specific phenotype using the information extracted from the narrative reports associated with the patient. Furthermore, the phenotypic annotations from our report dataset were performed at the report level which allows us to perform the prediction of the clinical phenotype at any point in time during the patient hospitalization period. Our experiments indicate that an important factor in achieving better results for this problem is to determine how much information to extract from the patient reports in the time interval between the patient admission time and the current prediction time.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(a) The distribution of patients with respect to the ICU day when they were first identified as positive for pneumonia. (b) The distribution of reports with timestamps in a specific time interval.
Figure 2.
Figure 2.
Graphical representation of a data instance for the problem of on-time clinical phenotype prediction.
Figure 3.
Figure 3.
System performance results under different configurations. (a), (b) and (c) show the baseline results of the system when considering various combinations of word and UMLS concept n-grams. (d) shows the experiments performed when using the assertion feature as well as combinations of this feature with word and concept unigrams. Finally, (e), (f), (g), and (h) illustrate the steps of a greedy algorithm for searching the most optimal threshold values which indicate what word and concept n-gram features to be selected in the learning framework.

Similar articles

Cited by

References

    1. Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Inform Assoc. 2011 Sep-Oct;18(5):544–51. - PMC - PubMed
    1. Demner-Fushman D, Chapman WW, McDonald CJ. What can natural language processing do for clinical decision support? J Biomed Inform. 2009 Oct;42(5):760–72. - PMC - PubMed
    1. Ritchie MD, Denny JC, Crawford DC, Ramirez AH, Weiner JB, Pulley JM, Basford MA, Brown-Gentry K, Balser JR, Masys DR, Haines JL, Roden DM. Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. Am J Hum Genet. 2010;86(4):560–72. - PMC - PubMed
    1. Ritchie MD, Denny JC, Zuvich RL, Crawford DC, Schildcrout JS, Bastarache L, Ramirez AH, Mosely JD, Pulley JM, Basford MA, Bradford Y, Rasmussen LV, Pathak J, Chute CG, Kullo IJ, McCarty CA, Chisholm RL, Kho AN, Carlson CS, Larson EB, Jarvik GP, Sotoodehnia N, Manolio TA, Li R, Masys DR, Haines JL, Roden DM. Genome- and Phenome-Wide Analysis of Cardiac Conduction Identifies Markers of Arrhythmia Risk. Circulation. 2013. In print. - PMC - PubMed
    1. Liao KP, Kurreeman F, Li G, Duclos G, Murphy S, Guzman R, Cai T, Gupta N, Gainer V, Schur P, Cui J, Denny JC, Szolovits P, Churchill S, Kohane I, Karlson EW, Plenge RM. Associations of autoantibodies, autoimmune risk alleles, and clinical diagnoses from the electronic medical records in rheumatoid arthritis cases and non-rheumatoid arthritis controls. Arthritis Rheum. 2013 Mar;65(3):571–81. - PMC - PubMed

Publication types

LinkOut - more resources