ARTDeco: automatic readthrough transcription detection

BMC Bioinformatics. 2020 May 26;21(1):214. doi: 10.1186/s12859-020-03551-0.

Abstract

Background: Mounting evidence suggests several diseases and biological processes target transcription termination to misregulate gene expression. Disruption of transcription termination leads to readthrough transcription past the 3' end of genes, which can result in novel transcripts, changes in epigenetic states and altered 3D genome structure.

Results: We developed Automatic Readthrough Transcription Detection (ARTDeco), a tool to detect and analyze multiple features of readthrough transcription from RNA-seq and other next-generation sequencing (NGS) assays that profile transcriptional activity. ARTDeco robustly quantifies the global severity of readthrough phenotypes, and reliably identifies individual genes that fail to terminate (readthrough genes), are aberrantly transcribed due to upstream termination failure (read-in genes), and novel transcripts created as a result of readthrough (downstream of gene or DoG transcripts). We used ARTDeco to characterize readthrough transcription observed during influenza A virus (IAV) infection, validating its specificity and sensitivity by comparing its performance in samples infected with a mutant virus that fails to block transcription termination. We verify ARTDeco's ability to detect readthrough as well as identify read-in genes from different experimental assays across multiple experimental systems with known defects in transcriptional termination, and show how these results can be leveraged to improve the interpretation of gene expression and downstream analysis. Applying ARTDeco to a gene expression data set from IAV-infected monocytes from different donors, we find strong evidence that read-in gene-associated expression quantitative trait loci (eQTLs) likely regulate genes upstream of read-in genes. This indicates that taking readthrough transcription into account is important for the interpretation of eQTLs in systems where transcription termination is blocked.

Conclusions: ARTDeco aids researchers investigating readthrough transcription in a variety of systems and contexts.

Keywords: Gene expression; Next-generation sequencing analysis; Readthrough transcription; Transcription termination; Transcriptomics.

MeSH terms

  • Gene Expression Regulation
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Influenza A virus / physiology
  • Monocytes / metabolism
  • Monocytes / virology
  • Quantitative Trait Loci
  • RNA-Seq
  • Software*
  • Transcription Termination, Genetic
  • Transcription, Genetic*