miRCat2: accurate prediction of plant and animal microRNAs from next-generation sequencing datasets

Bioinformatics. 2017 Aug 15;33(16):2446-2454. doi: 10.1093/bioinformatics/btx210.

Abstract

Motivation: MicroRNAs are a class of ∼21-22 nt small RNAs which are excised from a stable hairpin-like secondary structure. They have important gene regulatory functions and are involved in many pathways including developmental timing, organogenesis and development in eukaryotes. There are several computational tools for miRNA detection from next-generation sequencing datasets. However, many of these tools suffer from high false positive and false negative rates. Here we present a novel miRNA prediction algorithm, miRCat2. miRCat2 incorporates a new entropy-based approach to detect miRNA loci, which is designed to cope with the high sequencing depth of current next-generation sequencing datasets. It has a user-friendly interface and produces graphical representations of the hairpin structure and plots depicting the alignment of sequences on the secondary structure.

Results: We test miRCat2 on a number of animal and plant datasets and present a comparative analysis with miRCat, miRDeep2, miRPlant and miReap. We also use mutants in the miRNA biogenesis pathway to evaluate the predictions of these tools. Results indicate that miRCat2 has an improved accuracy compared with other methods tested. Moreover, miRCat2 predicts several new miRNAs that are differentially expressed in wild-type versus mutants in the miRNA biogenesis pathway.

Availability and implementation: miRCat2 is part of the UEA small RNA Workbench and is freely available from http://srna-workbench.cmp.uea.ac.uk/.

Contact: v.moulton@uea.ac.uk or s.moxon@uea.ac.uk.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology / methods*
  • Entropy
  • Genetic Loci*
  • High-Throughput Nucleotide Sequencing / methods*
  • MicroRNAs / genetics*
  • Plants / genetics
  • Plants / metabolism
  • Sequence Analysis, DNA / methods
  • Sequence Analysis, RNA / methods
  • Software*

Substances

  • MicroRNAs