The rare lncRNA GOLLD is widespread and structurally conserved among Mycobacterium tRNA arrays

RNA Biol. 2020 Jul;17(7):1001-1008. doi: 10.1080/15476286.2020.1748922. Epub 2020 Apr 22.

Abstract

Noncoding RNA (ncRNA) genes produce transcripts involved in a wide range of functions, including catalytic and regulatory functions. Besides, some transcripts have highly complex structures that may impact their activities. Among the largest bacterial ncRNAs, there is the rare GOLLD RNA, which is associated with tRNA genes and supposed to be chromosome- and phage-encoded in specialized groups of bacteria, including those from Lactobacillales and Actinomycetales orders. The only GOLLD structure was inferred from a variety of sequences, including many marine metagenomes. To explore GOLLD RNA in bacterial genomes, we mined the GOLLD gene in thousands of Mycobacterium and virus genomes using Infernal software. We identified this gene in 350 mycobacteria, including megaplasmids, and 39 bacteriophages, mainly in the genomic context of tRNA arrays. Mycobacterium GOLLD genes presented a high diversity and were distributed in three phylogenetic groups: (i) Mycobacterium exclusive; (ii) Mycobacterium and mycobacteriophages; and (iii) mycobacteriophage exclusive. We also determined the GOLLD secondary structure of each group using R2 R software based on GOLLD alignments generated by Infernal software. All GOLLD groups displayed a 3' half conserved structure, including utter E-loops pseudoknots substructures, also shared by non-Mycobacterium GOLLD while the 5' half motif was different among the groups. Here, we showed that the lncRNA GOLLD is widespread in Mycobacterium within tRNA arrays and corroborated the previously predicted GOLLD secondary structure.

Keywords: Mycobacterium; GOLLD secondary structure; HNH endonuclease; lncRNA GOLLD; tRNA array.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome, Viral
  • Genomics / methods
  • Mycobacteriophages / classification
  • Mycobacteriophages / genetics
  • Mycobacterium / classification
  • Mycobacterium / genetics*
  • Phylogeny
  • RNA, Long Noncoding*
  • RNA, Transfer / chemistry*
  • RNA, Transfer / genetics*
  • Reverse Transcriptase Polymerase Chain Reaction

Substances

  • RNA, Long Noncoding
  • RNA, Transfer

Grants and funding

This work was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Coordenação de Aperfeiçoamento de Pessoal de Nível Superior -Brasil (CAPES) - Finance Code 001, and Oswaldo Cruz Institute grants.