Principles of Large-scale Evidence Generation and Evaluation across a Network of Databases (LEGEND)

Martijn J Schuemie; Patrick B Ryan; Nicole Pratt; RuiJun Chen; Seng Chan You; Harlan M Krumholz; David Madigan; George Hripcsak; Marc A Suchard

doi:10.1093/jamia/ocaa103

Principles of Large-scale Evidence Generation and Evaluation across a Network of Databases (LEGEND)

J Am Med Inform Assoc. 2020 Aug 1;27(8):1331-1337. doi: 10.1093/jamia/ocaa103.

Authors

Martijn J Schuemie^{1

2}, Patrick B Ryan^{1

3}, Nicole Pratt⁴, RuiJun Chen^{3

5}, Seng Chan You⁶, Harlan M Krumholz⁷, David Madigan⁸, George Hripcsak^{3

9}, Marc A Suchard^{2

10}

Affiliations

¹ Epidemiology Analytics, Janssen Research and Development, Titusville, New Jersey, USA.
² Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, California, USA.
³ Department of Biomedical Informatics, Columbia University Medical Center, New York, New York, USA.
⁴ Quality Use of Medicines and Pharmacy Research Centre, University of South Australia, Adelaide, South Australia, Australia.
⁵ Department of Medicine, Weill Cornell Medical College, New York, New York, USA.
⁶ Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea.
⁷ Department of Medicine, Yale University School of Medicine, New Haven, California, USA.
⁸ Department of Statistics, Columbia University, New York, New York, USA.
⁹ Medical Informatics Services, New York-Presbyterian Hospital, New York, New York, USA.
¹⁰ Department of Biomathematics, David Geffen School of Medicine at UCLA, University of California, Los Angeles, California, USA.

Abstract

Evidence derived from existing health-care data, such as administrative claims and electronic health records, can fill evidence gaps in medicine. However, many claim such data cannot be used to estimate causal treatment effects because of the potential for observational study bias; for example, due to residual confounding. Other concerns include P hacking and publication bias. In response, the Observational Health Data Sciences and Informatics international collaborative launched the Large-scale Evidence Generation and Evaluation across a Network of Databases (LEGEND) research initiative. Its mission is to generate evidence on the effects of medical interventions using observational health-care databases while addressing the aforementioned concerns by following a recently proposed paradigm. We define 10 principles of LEGEND that enshrine this new paradigm, prescribing the generation and dissemination of evidence on many research questions at once; for example, comparing all treatments for a disease for many outcomes, thus preventing publication bias. These questions are answered using a prespecified and systematic approach, avoiding P hacking. Best-practice statistical methods address measured confounding, and control questions (research questions where the answer is known) quantify potential residual bias. Finally, the evidence is generated in a network of databases to assess consistency by sharing open-source analytics code to enhance transparency and reproducibility, but without sharing patient-level information. Here we detail the LEGEND principles and provide a generic overview of a LEGEND study. Our companion paper highlights an example study on the effects of hypertension treatments, and evaluates the internal and external validity of the evidence we generate.

Keywords: empirical calibration; observational studies; open science; treatment effects.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Antihypertensive Agents / adverse effects
Antihypertensive Agents / therapeutic use*
Computer Communication Networks*
Confidence Intervals
Data Interpretation, Statistical
Databases, Factual*
Humans
Hypertension / drug therapy*
Meta-Analysis as Topic*
Observation
Propensity Score
Randomized Controlled Trials as Topic
Treatment Outcome

Substances

Antihypertensive Agents

Abstract

Publication types

MeSH terms

Substances

Grants and funding