Chromatin interaction-aware gene regulatory modeling with graph attention networks

Genome Res. 2022 May;32(5):930-944. doi: 10.1101/gr.275870.121. Epub 2022 Apr 8.

Abstract

Linking distal enhancers to genes and modeling their impact on target gene expression are longstanding unresolved problems in regulatory genomics and critical for interpreting noncoding genetic variation. Here, we present a new deep learning approach called GraphReg that exploits 3D interactions from chromosome conformation capture assays to predict gene expression from 1D epigenomic data or genomic DNA sequence. By using graph attention networks to exploit the connectivity of distal elements up to 2 Mb away in the genome, GraphReg more faithfully models gene regulation and more accurately predicts gene expression levels than the state-of-the-art deep learning methods for this task. Feature attribution used with GraphReg accurately identifies functional enhancers of genes, as validated by CRISPRi-FlowFISH and TAP-seq assays, outperforming both convolutional neural networks (CNNs) and the recently proposed activity-by-contact model. Sequence-based GraphReg also accurately predicts direct transcription factor (TF) targets as validated by CRISPRi TF knockout experiments via in silico ablation of TF binding motifs. GraphReg therefore represents an important advance in modeling the regulatory impact of epigenomic and sequence elements.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Binding Sites / genetics
  • Chromatin / genetics
  • Gene Expression Regulation*
  • Gene Regulatory Networks
  • Neural Networks, Computer*
  • Regulatory Sequences, Nucleic Acid
  • Transcription Factors / metabolism

Substances

  • Chromatin
  • Transcription Factors