Mining for missed sORF-encoded peptides

Expert Rev Proteomics. 2019 Mar;16(3):257-266. doi: 10.1080/14789450.2019.1571919. Epub 2019 Feb 13.

Abstract

Small open reading frames (sORFs) with potential protein-coding capacity have been disclosed in various transcripts, including long noncoding RNAs (LncRNAs), mRNAs (5'-upstream, coding domain, and 3'-downstream), circular RNAs, pri-miRNAs, and ribosomal RNAs (rRNAs). Recent characterization of several sORF-encoded peptides (SEPs or micropeptides) revealed their important roles in many fundamental biological processes in a broad range of species from yeast to human. The success in the mining of micropeptides attributes to the advanced bioinformatics and high-throughput sequencing techniques. Areas covered: sORFs and SEPs were overlooked for their tiny size and the difficulty of identification by bioinformatics analyses. With more and more sORFs and SEPs have been identified, this field has attracted more attention. This review covers recent advances in the strategies for the detection and identification of sORFs and SEPs. Expert commentary: The advantages and drawbacks of the strategies for detection and identification of sORFs and SEPs are discussed, as well as the techniques that are used to decipher the roles of micropeptides in organisms are described.

Keywords: bioinformatics; micropeptides; ribosome profiling.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Computational Biology
  • Humans
  • Open Reading Frames / genetics*
  • Peptides / genetics*
  • Proteomics*
  • RNA, Circular / genetics
  • RNA, Long Noncoding / genetics
  • RNA, Messenger / genetics
  • RNA, Ribosomal / genetics

Substances

  • Peptides
  • RNA, Circular
  • RNA, Long Noncoding
  • RNA, Messenger
  • RNA, Ribosomal