Mining for Small Translated ORFs

J Proteome Res. 2018 Jan 5;17(1):1-11. doi: 10.1021/acs.jproteome.7b00707. Epub 2017 Dec 11.

Abstract

Peptides encoded by short open reading frames (sORFs) are usually defined as peptides ≤100 aa long. Usually sORFs were ignored by automatic genome annotation programs due to the high probability of false discovery. However, improved computational tools along with a high-throughput RIBO-seq approach identified a myriad of translated sORFs. Their importance becomes evident as we are gaining experimental validation of their diverse cellular functions. This Review examines various computational and experimental approaches of sORFs identification as well as provides the summary of our current knowledge of their functional roles in cells.

Keywords: RIBO-seq; coding potential; genome annotation; lncRNA; peptide; ribosome profiling; small ORF; small peptide; translation; uORF.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Data Mining / methods
  • Open Reading Frames / genetics*
  • Open Reading Frames / physiology
  • Peptides / genetics*
  • Peptides / physiology

Substances

  • Peptides