Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec;18 Suppl 1(Suppl 1):i116-24.
doi: 10.1136/amiajnl-2011-000321. Epub 2011 Jul 31.

EliXR: an approach to eligibility criteria extraction and representation

Affiliations

EliXR: an approach to eligibility criteria extraction and representation

Chunhua Weng et al. J Am Med Inform Assoc. 2011 Dec.

Abstract

Objective: To develop a semantic representation for clinical research eligibility criteria to automate semistructured information extraction from eligibility criteria text.

Materials and methods: An analysis pipeline called eligibility criteria extraction and representation (EliXR) was developed that integrates syntactic parsing and tree pattern mining to discover common semantic patterns in 1000 eligibility criteria randomly selected from http://ClinicalTrials.gov. The semantic patterns were aggregated and enriched with unified medical language systems semantic knowledge to form a semantic representation for clinical research eligibility criteria.

Results: The authors arrived at 175 semantic patterns, which form 12 semantic role labels connected by their frequent semantic relations in a semantic network.

Evaluation: Three raters independently annotated all the sentence segments (N=396) for 79 test eligibility criteria using the 12 top-level semantic role labels. Eight-six per cent (339) of the sentence segments were unanimously labelled correctly and 13.8% (55) were correctly labelled by two raters. The Fleiss' κ was 0.88, indicating a nearly perfect interrater agreement.

Conclusion: This study present a semi-automated data-driven approach to developing a semantic network that aligns well with the top-level information structure in clinical research eligibility criteria text and demonstrates the feasibility of using the resulting semantic role labels to generate semistructured eligibility criteria with nearly perfect interrater reliability.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The hierarchical syntax of an example criterion. Semantic role labels are in bold text. The corresponding sentence constituents are in italic text.
Figure 2
Figure 2
The EliXR framework and its key steps. UMLS, unified medical language systems.
Figure 3
Figure 3
An example semantically labelled parse tree for myocardial infarction within 90 days of study start, unstable angina within 14 days of study start, or any clinical evidence of active myocardial ischemia.
Figure 4
Figure 4
The EliXR semantic network for eligibility criteria.

Similar articles

Cited by

References

    1. Thadani SR, Weng C, Bigger JT, et al. Electronic screening improves efficiency in clinical trial recruitment. J Am Med Inform Assoc 2009;16:869–73 - PMC - PubMed
    1. Weng C, Tu SW, Sim I, et al. Formal representation of eligibility criteria: a literature review. J Biomed Inform 2010;43:451–67 - PMC - PubMed
    1. Tu SW, Campbell JR, Glasgow J, et al. The SAGE Guideline Model: achievements and overview. J Am Med Inform Assoc 2007;14:589–98 - PMC - PubMed
    1. Tu SW, Peleg M, Carini S, et al. A practical method for transforming free-text eligibility criteria into computable criteria. J Biomed Inform 2011;44:239–50 - PMC - PubMed
    1. Definition of a semantic role by Conference on Computational Natural Language Learning (CCNLL). http://www.lsi.upc.edu/∼srlconll/ (accessed 25 May 2011).

Publication types