emiRIT: a text-mining-based resource for microRNA information

Database (Oxford). 2021 May 28:2021:baab031. doi: 10.1093/database/baab031.

Abstract

microRNAs (miRNAs) are essential gene regulators, and their dysregulation often leads to diseases. Easy access to miRNA information is crucial for interpreting generated experimental data, connecting facts across publications and developing new hypotheses built on previous knowledge. Here, we present extracting miRNA Information from Text (emiRIT), a text-miningbased resource, which presents miRNA information mined from the literature through a user-friendly interface. We collected 149 ,233 miRNA -PubMed ID pairs from Medline between January 1997 and May 2020. emiRIT currently contains 'miRNA -gene regulation' (69 ,152 relations), 'miRNA disease (cancer)' (12 ,300 relations), 'miRNA -biological process and pathways' (23, 390 relations) and circulatory 'miRNAs in extracellular locations' (3782 relations). Biological entities and their relation to miRNAs were extracted from Medline abstracts using publicly available and in-house developed text-mining tools, and the entities were normalized to facilitate querying and integration. We built a database and an interface to store and access the integrated data, respectively. We provide an up-to-date and user-friendly resource to facilitate access to comprehensive miRNA information from the literature on a large scale, enabling users to navigate through different roles of miRNA and examine them in a context specific to their information needs. To assess our resource's information coverage, we have conducted two case studies focusing on the target and differential expression information of miRNAs in the context of cancer and a third case study to assess the usage of emiRIT in the curation of miRNA information. Database URL: https://research.bioinformatics.udel.edu/emirit/.

MeSH terms

  • Data Mining*
  • Databases, Factual
  • MEDLINE
  • MicroRNAs* / genetics
  • PubMed

Substances

  • MicroRNAs