Nanopore direct RNA sequencing detects DUX4-activated repeats and isoforms in human muscle cells

Hum Mol Genet. 2021 May 12;30(7):552-563. doi: 10.1093/hmg/ddab063.

Abstract

Facioscapulohumeral muscular dystrophy (FSHD) is an inherited muscle disease caused by misexpression of the DUX4 gene in skeletal muscle. DUX4 is a transcription factor, which is normally expressed in the cleavage-stage embryo and regulates gene expression involved in early embryonic development. Recent studies revealed that DUX4 also activates the transcription of repetitive elements such as endogenous retroviruses (ERVs), mammalian apparent long terminal repeat (LTR)-retrotransposons and pericentromeric satellite repeats (Human Satellite II). DUX4-bound ERV sequences also create alternative promoters for genes or long non-coding RNAs, producing fusion transcripts. To further understand transcriptional regulation by DUX4, we performed nanopore long-read direct RNA sequencing (dRNA-seq) of human muscle cells induced by DUX4, because long reads show whole isoforms with greater confidence. We successfully detected differential expression of known DUX4-induced genes and discovered 61 differentially expressed repeat loci, which are near DUX4-ChIP peaks. We also identified 247 gene-ERV fusion transcripts, of which 216 were not reported previously. In addition, long-read dRNA-seq clearly shows that RNA splicing is a common event in DUX4-activated ERV transcripts. Long-read analysis showed non-LTR transposons including Alu elements are also transcribed from LTRs. Our findings revealed further complexity of DUX4-induced ERV transcripts. This catalogue of DUX4-activated repetitive elements may provide useful information to elucidate the pathology of FSHD. Also, our results indicate that nanopore dRNA-seq has complementary strengths to conventional short-read complementary DNA sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line, Tumor
  • Gene Expression Profiling
  • Gene Expression Regulation
  • Homeodomain Proteins / genetics*
  • Humans
  • Muscle Cells / metabolism
  • Muscle, Skeletal / metabolism*
  • Muscular Dystrophy, Facioscapulohumeral / genetics*
  • Muscular Dystrophy, Facioscapulohumeral / pathology
  • Nanopores*
  • Protein Isoforms / genetics
  • RNA Isoforms / genetics
  • Repetitive Sequences, Nucleic Acid / genetics*
  • Reverse Transcriptase Polymerase Chain Reaction
  • Sequence Analysis, RNA / methods*
  • Sequence Analysis, RNA / statistics & numerical data

Substances

  • DUX4L1 protein, human
  • Homeodomain Proteins
  • Protein Isoforms
  • RNA Isoforms