R-loopBase: a knowledgebase for genome-wide R-loop formation and regulation

Nucleic Acids Res. 2022 Jan 7;50(D1):D303-D315. doi: 10.1093/nar/gkab1103.

Abstract

R-loops play versatile roles in many physiological and pathological processes, and are of great interest to scientists in multiple fields. However, controversy about their genomic localization and incomplete understanding of their regulatory network raise great challenges for R-loop research. Here, we present R-loopBase (https://rloopbase.nju.edu.cn) to tackle these pressing issues by systematic integration of genomics and literature data. First, based on 107 high-quality genome-wide R-loop mapping datasets generated by 11 different technologies, we present a reference set of human R-loop zones for high-confidence R-loop localization, and spot conservative genomic features associated with R-loop formation. Second, through literature mining and multi-omics analyses, we curate the most comprehensive list of R-loop regulatory proteins and their targeted R-loops in multiple species to date. These efforts help reveal a global regulatory network of R-loop dynamics and its potential links to the development of cancers and neurological diseases. Finally, we integrate billions of functional genomic annotations, and develop interactive interfaces to search, visualize, download and analyze R-loops and R-loop regulators in a well-annotated genomic context. R-loopBase allows all users, including those with little bioinformatics background to utilize these data for their own research. We anticipate R-loopBase will become a one-stop resource for the R-loop community.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Line, Tumor
  • Chromosome Mapping
  • Computational Biology / methods
  • DNA / chemistry
  • DNA / genetics*
  • DNA / metabolism
  • Databases, Nucleic Acid
  • Datasets as Topic
  • Gene Regulatory Networks
  • Genome*
  • Genomic Instability
  • HEK293 Cells
  • Humans
  • Internet
  • Molecular Sequence Annotation
  • Neoplasms / genetics*
  • Neoplasms / metabolism
  • Neoplasms / pathology
  • Nervous System Diseases / genetics*
  • Nervous System Diseases / metabolism
  • Nervous System Diseases / pathology
  • Protein Interaction Mapping / methods
  • R-Loop Structures*
  • RNA / chemistry
  • RNA / genetics*
  • RNA / metabolism
  • Software*
  • Transcription, Genetic

Substances

  • RNA
  • DNA