EvidenceMap: a three-level knowledge representation for medical evidence computation and comprehension

Tian Kang; Yingcheng Sun; Jae Hyun Kim; Casey Ta; Adler Perotte; Kayla Schiffer; Mutong Wu; Yang Zhao; Nour Moustafa-Fahmy; Yifan Peng; Chunhua Weng

doi:10.1093/jamia/ocad036

EvidenceMap: a three-level knowledge representation for medical evidence computation and comprehension

J Am Med Inform Assoc. 2023 May 19;30(6):1022-1031. doi: 10.1093/jamia/ocad036.

Authors

Affiliations

¹ Department of Biomedical Informatics, Columbia University, New York, New York, USA.
² Department of Statistics, Columbia University, New York, New York, USA.
³ Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Abstract

Objective: To develop a computable representation for medical evidence and to contribute a gold standard dataset of annotated randomized controlled trial (RCT) abstracts, along with a natural language processing (NLP) pipeline for transforming free-text RCT evidence in PubMed into the structured representation.

Materials and methods: Our representation, EvidenceMap, consists of 3 levels of abstraction: Medical Evidence Entity, Proposition and Map, to represent the hierarchical structure of medical evidence composition. Randomly selected RCT abstracts were annotated following EvidenceMap based on the consensus of 2 independent annotators to train an NLP pipeline. Via a user study, we measured how the EvidenceMap improved evidence comprehension and analyzed its representative capacity by comparing the evidence annotation with EvidenceMap representation and without following any specific guidelines.

Results: Two corpora including 229 disease-agnostic and 80 COVID-19 RCT abstracts were annotated, yielding 12 725 entities and 1602 propositions. EvidenceMap saves users 51.9% of the time compared to reading raw-text abstracts. Most evidence elements identified during the freeform annotation were successfully represented by EvidenceMap, and users gave the enrollment, study design, and study Results sections mean 5-scale Likert ratings of 4.85, 4.70, and 4.20, respectively. The end-to-end evaluations of the pipeline show that the evidence proposition formulation achieves F1 scores of 0.84 and 0.86 in the adjusted random index score.

Conclusions: EvidenceMap extends the participant, intervention, comparator, and outcome framework into 3 levels of abstraction for transforming free-text evidence from the clinical literature into a computable structure. It can be used as an interoperable format for better evidence retrieval and synthesis and an interpretable representation to efficiently comprehend RCT findings.

Keywords: corpus annotation; evidence-based medicine; knowledge representation; medical literature analysis and retrieval system; natural language processing; randomized controlled trial.

Publication types

Randomized Controlled Trial
Research Support, N.I.H., Extramural

MeSH terms

COVID-19*
Comprehension*
Humans
Natural Language Processing
PubMed

Abstract

Publication types

MeSH terms

Grants and funding