piRNA-IPdb: a PIWI-bound piRNAs database to mining NGS sncRNA data and beyond

BMC Genomics. 2021 Oct 26;22(1):765. doi: 10.1186/s12864-021-08071-6.

Abstract

Background: PIWI-interacting RNAs (piRNAs) are an abundant single-stranded type of small non-coding RNAs (sncRNAs), which initially were discovered in gonadal cells, with a role as defenders of genomic integrity in the germline, acting against the transposable elements. With a regular size range of 21-35 nt, piRNAs are associated with a PIWI-clade of Argonaute family proteins. The most widely accepted mechanisms of biogenesis for piRNAs involve the transcription of longer precursors of RNAs to be processed, by complexes of proteins, to functional size, preferentially accommodating uridine residues at the 5' end and 3' methylation to increase the stability of these molecules. piRNAs have also been detected in somatic cells, with diverse potential functions, indicating their high plasticity and pleiotropic activity. Discovered about two decades ago, piRNAs are a large and versatile type of sncRNA and that remain insufficiently identified and analyzed, through next-generation sequencing (NGS), to evaluate their processing, functions, and biogenesis in different cell types and during development. piRNAs' distinction from other sncRNAs has led to controversial results and interpretation difficulties when using existing databases because of the heterogeneity of the criteria used in making the distinction.

Description: We present "piRNA-IPdb", a database based uniquely on datasets obtaining after the defining characteristic of piRNAs: those small RNAs bound to PIWI proteins. We selected and analyzed sequences from piRBase that exclusively cover the binding to PIWI. We pooled a total of 18,821,815 sequences from RNA-seq after immunoprecipitation that included the binding to any of the mouse PIWI proteins (MILI, MIWI, or MIWI2).

Conclusions: In summary, we present the characteristics and potential use of piRNA-IPdb database for the analysis of bona fide piRNAs.

Keywords: Database; Immunoprecipitation; PIWI; microRNAs; piRNAs.

MeSH terms

  • Animals
  • Argonaute Proteins / genetics
  • Argonaute Proteins / metabolism
  • DNA Transposable Elements
  • High-Throughput Nucleotide Sequencing
  • Mice
  • RNA, Small Interfering / genetics
  • RNA, Small Untranslated* / genetics

Substances

  • Argonaute Proteins
  • DNA Transposable Elements
  • RNA, Small Interfering
  • RNA, Small Untranslated