Genome-Wide Analysis of Alternative Splicing and Non-Coding RNAs Reveal Complicated Transcriptional Regulation in Cannabis sativa L

Int J Mol Sci. 2021 Nov 5;22(21):11989. doi: 10.3390/ijms222111989.

Abstract

It is of significance to mine the structural genes related to the biosynthetic pathway of fatty acid (FA) and cellulose as well as explore the regulatory mechanism of alternative splicing (AS), microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) in the biosynthesis of cannabinoids, FA and cellulose, which would enhance the knowledge of gene expression and regulation at post-transcriptional level in Cannabis sativa L. In this study, transcriptome, small RNA and degradome libraries of hemp 'Yunma No.1' were established, and comprehensive analysis was performed. As a result, a total of 154, 32 and 331 transcripts encoding key enzymes involved in the biosynthesis of cannabinoids, FA and cellulose were predicted, respectively, among which AS occurred in 368 transcripts. Moreover, 183 conserved miRNAs, 380 C. sativa-specific miRNAs and 7783 lncRNAs were predicted. Among them, 70 miRNAs and 17 lncRNAs potentially targeted 13 and 17 transcripts, respectively, encoding key enzymes or transporters involved in the biosynthesis of cannabinoids, cellulose or FA. Finally, the crosstalk between AS and miRNAs or lncRNAs involved in cannabinoids and cellulose was also predicted. In summary, all these results provided insights into the complicated network of gene expression and regulation in C. sativa.

Keywords: Cannabis sativa L.; alternative splicing; gene expression and regulation; non-coding RNAs.

MeSH terms

  • Alternative Splicing
  • Biosynthetic Pathways
  • Cannabinoids / metabolism
  • Cannabis / genetics*
  • Cannabis / metabolism
  • Cellulose / metabolism
  • Fatty Acids / metabolism
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Plant
  • Gene Regulatory Networks
  • Genome, Plant
  • MicroRNAs / genetics
  • Plant Proteins / genetics*
  • Plant Proteins / metabolism
  • RNA, Long Noncoding / genetics*
  • Whole Genome Sequencing

Substances

  • Cannabinoids
  • Fatty Acids
  • MicroRNAs
  • Plant Proteins
  • RNA, Long Noncoding
  • Cellulose