Gene Ontology-based function prediction of long non-coding RNAs using bi-random walk

BMC Med Genomics. 2018 Nov 20;11(Suppl 5):99. doi: 10.1186/s12920-018-0414-2.

Abstract

Background: With the development of sequencing technology, more and more long non-coding RNAs (lncRNAs) have been identified. Some lncRNAs have been confirmed that they play an important role in the process of development through the dosage compensation effect, epigenetic regulation, cell differentiation regulation and other aspects. However, the majority of the lncRNAs have not been functionally characterized. Explore the function of lncRNAs and the regulatory network has become a hot research topic currently.

Methods: In the work, a network-based model named BiRWLGO is developed. The ultimate goal is to predict the probable functions for lncRNAs at large scale. The new model starts with building a global network composed of three networks: lncRNA similarity network, lncRNA-protein association network and protein-protein interaction (PPI) network. After that, it utilizes bi-random walk algorithm to explore the similarities between lncRNAs and proteins. Finally, we can annotate an lncRNA with the Gene Ontology (GO) terms according to its neighboring proteins.

Results: We compare the performance of BiRWLGO with the state-of-the-art models on a manually annotated lncRNA benchmark with known GO terms. The experimental results assert that BiRWLGO outperforms other methods in terms of both maximum F-measure (Fmax) and coverage.

Conclusions: BiRWLGO is a relatively efficient method to predict the functions of lncRNA. When protein interaction data is integrated, the predictive performance of BiRWLGO gains a great improvement.

Keywords: Bi-random; Function annotation; lncRNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods
  • Gene Ontology*
  • Gene Regulatory Networks
  • Humans
  • MicroRNAs / genetics
  • MicroRNAs / metabolism
  • Protein Interaction Maps / genetics
  • Proteins / metabolism
  • RNA, Long Noncoding / genetics
  • RNA, Long Noncoding / metabolism*
  • Stomach Neoplasms / genetics
  • Stomach Neoplasms / pathology

Substances

  • GHET1 transcript, human
  • MicroRNAs
  • Proteins
  • RNA, Long Noncoding
  • long non-coding RNA HOTAIRM1, human