HALO: hierarchical causal modeling for single cell multi-omics data

Nat Commun. 2025 Oct 7;16(1):8892. doi: 10.1038/s41467-025-63921-1.

Abstract

Though open chromatin may promote active transcription, gene expression responses may not be directly coordinated with changes in chromatin accessibility. Most existing methods for single-cell multi-omics data focus only on learning stationary, shared information among these modalities, overlooking modality-specific information delineating cellular states and dynamics resulting from causal relations among modalities. To address this, the epigenome-transcriptome relationship can be characterized in relation to time as coupled (changing dependently) or decoupled (changing independently). We propose the framework HALO, adopting a causal approach to model these temporal causal relations on two levels. On the representation level, HALO factorizes these two modalities into both coupled and decoupled latent representations, revealing their dynamic interplay. On the individual gene level, HALO matches gene-peak pairs and characterizes their changes over time. HALO discovers analogous biological functions between modalities, distinguishes epigenetic factors for lineage specification, and identifies temporal cis-regulation interactions relevant to cellular differentiation and human diseases.

MeSH terms

  • Cell Differentiation / genetics
  • Chromatin / genetics
  • Chromatin / metabolism
  • Computational Biology* / methods
  • Epigenesis, Genetic
  • Epigenome
  • Epigenomics / methods
  • Gene Expression Profiling
  • Humans
  • Multiomics
  • Single-Cell Analysis* / methods
  • Transcriptome

Substances

  • Chromatin