A common longitudinal intensive care unit data format (CLIF) for critical illness research

Intensive Care Med. 2025 Mar;51(3):556-569. doi: 10.1007/s00134-025-07848-7. Epub 2025 Mar 13.

Abstract

Rationale: Critical illness threatens millions of lives annually. Electronic health record (EHR) data are a source of granular information that could generate crucial insights into the nature and optimal treatment of critical illness.

Objectives: Overcome the data management, security, and standardization barriers to large-scale critical illness EHR studies.

Methods: We developed a Common Longitudinal Intensive Care Unit (ICU) data Format (CLIF), an open-source database format to harmonize EHR data necessary to study critical illness. We conducted proof-of-concept studies with a federated research architecture: (1) an external validation of an in-hospital mortality prediction model for critically ill patients and (2) an assessment of 72-h temperature trajectories and their association with mechanical ventilation and in-hospital mortality using group-based trajectory models.

Measurements and main results: We converted longitudinal data from 111,440 critically ill patient admissions from 2020 to 2021 (mean age 60.7 years [standard deviation 17.1], 28% Black, 7% Hispanic, 44% female) across 9 health systems and 39 hospitals into CLIF databases. The in-hospital mortality prediction model had varying performance across CLIF consortium sites (AUCs: 0.73-0.81, Brier scores: 0.06-0.10) with degradation in performance relative to the derivation site. Temperature trajectories were similar across health systems. Hypothermic and hyperthermic-slow-resolver patients consistently had the highest mortality.

Conclusions: CLIF enables transparent, efficient, and reproducible critical care research across diverse health systems. Our federated case studies showcase CLIF's potential for disease sub-phenotyping and clinical decision-support evaluation. Future applications include pragmatic EHR-based trials, target trial emulations, foundational artificial intelligence (AI) models of critical illness, and real-time critical care quality dashboards.

Keywords: Critical care data; Machine learning; Temperature trajectory modeling.

MeSH terms

  • Adult
  • Aged
  • Critical Illness* / mortality
  • Critical Illness* / therapy
  • Databases, Factual
  • Electronic Health Records* / statistics & numerical data
  • Female
  • Hospital Mortality
  • Humans
  • Intensive Care Units* / organization & administration
  • Intensive Care Units* / statistics & numerical data
  • Longitudinal Studies
  • Male
  • Middle Aged