Size-variable zone in V3 region of 16S rRNA

RNA Biol. 2017 Nov 2;14(11):1514-1521. doi: 10.1080/15476286.2017.1317912. Epub 2017 May 23.

Abstract

The size distribution of complete 16S-rRNA sequences from the SILVA-database and nucleotide shifts that might interfere with the secondary structure of the molecules were evaluated. Overall, 513,309 sequences recorded in SILVA were used to estimate the size of hypervariable regions of the gene. Redundant sequences were treated as a single sequence to achieve a better representation of the molecular diversity. Nucleotides found in each position in 95% of the sequences were considered the consensus sequences for different size-groups (consensus95). The sizes of different regions ranged from 96.7 to 283.1 nucleotides and had similar distribution patterns, except for the V3 region, which exhibited a bimodal distribution composed of 2 main peaks of 161 and 186 nt. The alignment of Consensuses95 of fractions 161 and 186 showed a high degree of similarity and conservation, except for the central positions (gap zone), where the sequence was highly variable and several deletions were observed. Structurally, the gap zone forms the central part of helix 17 (H17), and its extension was directly reflected in the size of this helix. H17 is part of a multihelix conjunction known as the 5-way junction (5 WJ), which is indispensable for 30 S ribosome assembly. However, because a drastic variation in the sequence size of V3 region occurs at a central position in loop H17 without affecting the base of the loop, it has no apparent effect on 5 WJ. Finally, considering that these differences were detected in non-redundant sequences, it can be concluded that this is not an uncommon or isolated event and that the V3 region is possibly more likely to mutate than are other regions.

Keywords: 16S gene; 5-way junction; V3 pattern; V3 size different; bacterial rRNA; helix 17; k-mers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / classification
  • Bacteria / genetics*
  • Base Pairing
  • Base Sequence
  • Consensus Sequence
  • Databases, Genetic
  • Genes, Bacterial*
  • Genetic Variation
  • Metagenome*
  • Nucleic Acid Conformation
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics*
  • Sequence Analysis, RNA / statistics & numerical data*

Substances

  • RNA, Ribosomal, 16S