Biological function of unannotated transcription during the early development of Drosophila melanogaster

Nat Genet. 2006 Oct;38(10):1151-8. doi: 10.1038/ng1875. Epub 2006 Sep 3.

Abstract

Many animal and plant genomes are transcribed much more extensively than current annotations predict. However, the biological function of these unannotated transcribed regions is largely unknown. Approximately 7% and 23% of the detected transcribed nucleotides during D. melanogaster embryogenesis map to unannotated intergenic and intronic regions, respectively. Based on computational analysis of coordinated transcription, we conservatively estimate that 29% of all unannotated transcribed sequences function as missed or alternative exons of well-characterized protein-coding genes. We estimate that 15.6% of intergenic transcribed regions function as missed or alternative transcription start sites (TSS) used by 11.4% of the expressed protein-coding genes. Identification of P element mutations within or near newly identified 5' exons provides a strategy for mapping previously uncharacterized mutations to their respective genes. Collectively, these data indicate that at least 85% of the fly genome is transcribed and processed into mature transcripts representing at least 30% of the fly genome.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • DNA, Intergenic
  • Drosophila Proteins / genetics
  • Drosophila melanogaster / embryology*
  • Drosophila melanogaster / genetics*
  • Embryo, Nonmammalian
  • Exons
  • Gene Expression Regulation, Developmental*
  • Genome, Insect
  • Molecular Sequence Data
  • Mutation
  • Oligonucleotide Array Sequence Analysis
  • Transcription Initiation Site
  • Transcription, Genetic*

Substances

  • DNA, Intergenic
  • Drosophila Proteins

Associated data

  • GENBANK/DQ327735
  • GENBANK/DQ327736
  • GENBANK/DQ327737
  • GENBANK/DQ327738
  • GENBANK/DQ327739
  • GENBANK/DQ327740
  • GENBANK/DQ327741
  • GENBANK/DQ327742
  • GENBANK/DQ327743
  • GENBANK/DQ327744
  • GENBANK/DQ327745
  • GENBANK/DQ327746
  • GENBANK/DQ327747
  • GENBANK/DQ327748
  • GENBANK/DQ327749
  • GENBANK/DQ327750
  • GENBANK/DQ327751
  • GENBANK/DQ327752
  • GENBANK/DQ327753
  • GENBANK/DQ327754
  • GENBANK/DQ327755
  • GENBANK/DQ327756
  • GENBANK/DQ327757
  • GENBANK/DQ327758
  • GENBANK/DQ327759
  • GENBANK/DQ327760
  • GENBANK/DQ327761
  • GENBANK/DQ327762
  • GENBANK/DQ327763
  • GENBANK/DQ327764
  • GENBANK/DQ327765
  • GENBANK/DQ327766
  • GENBANK/DQ327767
  • GENBANK/DQ327768
  • GENBANK/DQ327769
  • GENBANK/DQ327770
  • GENBANK/DQ327771
  • GENBANK/DQ327772
  • GENBANK/DQ327773
  • GENBANK/DQ327774
  • GENBANK/DQ327775
  • GENBANK/DQ327776
  • GENBANK/DQ327777
  • GENBANK/DQ327778
  • GENBANK/DQ327779
  • GENBANK/DQ327780
  • GENBANK/DQ327781
  • GENBANK/DQ327782
  • GENBANK/DQ327783
  • GENBANK/DQ327784
  • GENBANK/DQ327785
  • GENBANK/DQ327786
  • GENBANK/DQ327787
  • GENBANK/DQ327788
  • GENBANK/DQ327789
  • GENBANK/DQ327790
  • GENBANK/DQ327791
  • GENBANK/DQ327792
  • GENBANK/DQ327793
  • GENBANK/DQ327794
  • GENBANK/DQ327795
  • GENBANK/DQ327796
  • GENBANK/DQ327797
  • GENBANK/DQ327798
  • GENBANK/DQ327799
  • GENBANK/DQ327800
  • GENBANK/DQ327801
  • GENBANK/DQ327802
  • GENBANK/DQ327803
  • GENBANK/DQ327804
  • GENBANK/DQ327805
  • GENBANK/DQ327806
  • GENBANK/DQ327807
  • GEO/GSE5514