Functional annotation of the vlinc class of non-coding RNAs using systems biology approach

Nucleic Acids Res. 2016 Apr 20;44(7):3233-52. doi: 10.1093/nar/gkw162. Epub 2016 Mar 21.

Abstract

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Nucleus / genetics
  • Embryonic Development / genetics
  • Gene Expression Regulation
  • Humans
  • Insulator Elements
  • Molecular Sequence Annotation
  • Promoter Regions, Genetic
  • RNA, Long Noncoding / classification
  • RNA, Long Noncoding / genetics*
  • RNA, Long Noncoding / metabolism
  • Retroviridae / genetics
  • Systems Biology
  • Terminal Repeat Sequences
  • Transcription Factors / metabolism

Substances

  • RNA, Long Noncoding
  • Transcription Factors