Parallel identification of new genes in Saccharomyces cerevisiae

Genome Res. 2002 Aug;12(8):1210-20. doi: 10.1101/gr.226802.

Abstract

Short open reading frames (ORFs) occur frequently in primary genome sequence. Distinguishing bona fide small genes from the tens of thousands of short ORFs is one of the most challenging aspects of genome annotation. Direct experimental evidence is often required. Here we use a combination of expression profiling and mass spectrometry to verify the independent transcription of 138 and the translation of 50 previously nonannotated genes in the Saccharomyces cerevisiae genome. Through combined evidence, we propose the addition of 62 new genes to the genome and provide experimental support for the inclusion of 10 previously identified genes.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Chromosome Mapping
  • Chromosomes, Fungal / genetics
  • Computational Biology / methods
  • Conserved Sequence / genetics
  • Conserved Sequence / physiology
  • Databases, Genetic / classification
  • Drosophila Proteins / genetics
  • Gene Expression Profiling / methods
  • Genes, Fungal / genetics*
  • Genes, Fungal / physiology
  • Genome, Fungal*
  • Mice
  • Molecular Sequence Data
  • Open Reading Frames / genetics
  • Open Reading Frames / physiology
  • Peptides / analysis
  • Peptides / physiology
  • Protein Biosynthesis / genetics
  • Protein Biosynthesis / physiology
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae Proteins / analysis
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / physiology
  • Sequence Alignment
  • Sequence Homology, Amino Acid
  • Transcription, Genetic / physiology

Substances

  • Drosophila Proteins
  • Peptides
  • Saccharomyces cerevisiae Proteins