Cap analysis of gene expression (CAGE) and noncoding regulatory elements

Semin Immunopathol. 2022 Jan;44(1):127-136. doi: 10.1007/s00281-021-00886-5. Epub 2021 Sep 1.

Abstract

Cap analysis of gene expression (CAGE) was developed to detect the 5' end of RNA. Trapping of the RNA 5'-cap structure enables the enrichment and selective sequencing of complete transcripts. Upscaled high-throughput versions of CAGE have enabled the genome-wide identification of transcription start sites, including transcriptionally active promoters and enhancers. CAGE sequencing can be exploited to draw comprehensive maps of active genomic regulatory elements in a cell type- and activation-specific manner. The cells of the immune system are among the best candidates to be analyzed in humans, since they are easily accessible. In this review, we discuss how CAGE data are instrumental for integrative analyses with quantitative trait loci and omics data, and their usefulness in the mechanistic interpretation of the effects of genetic variations over the entire human genome. Integrating CAGE data with the currently available omics information will contribute to better understanding of the genome-wide association study variants that lie outside of annotated genes, deepening our knowledge on human diseases, and enabling the targeted design of more specific therapeutic interventions.

Keywords: CAGE sequencing; Enhancers and promoters; GWAS association; Genetic variation; QTL integration; RNA CAP-trapping.

Publication types

  • Review

MeSH terms

  • Gene Expression
  • Genome-Wide Association Study*
  • Humans
  • Promoter Regions, Genetic
  • Regulatory Sequences, Nucleic Acid* / genetics
  • Transcription Initiation Site