Beyond the 3' end: experimental validation of extended transcript isoforms

Nucleic Acids Res. 2007;35(6):1947-57. doi: 10.1093/nar/gkm062. Epub 2007 Mar 4.

Abstract

High throughput EST and full-length cDNA sequencing have revealed extensive variations at the 3' ends of mammalian transcripts. Whether all of these changes are biologically meaningful has been the subject of controversy, as such, results may reflect in part transcription or polyadenylation leakage. We selected here a set of tandem poly(A) sites predicted from EST/cDNA sequence analysis that (i) are conserved between human and mouse, (ii) produce alternative 3' isoforms with unusual size features and (iii) are not documented in current genome databases, and we submitted these sites to experimental validation in mouse tissues. Out of 86 tested poly(A) sites from 44 genes, 84 were individually confirmed using a specially devised RT-PCR strategy. We then focused on validating the exon structure between distant tandem poly(A) sites separated by over 3 kb, and between stop codons and alternative poly(A) sites located at 4.5 kb or more, using a long-distance RT-PCR strategy. In most cases, long transcripts spanning the whole poly(A)-poly(A) or stop-poly(A) distance were detected, confirming that tandem sites were part of the same transcription unit. Given the apparent conservation of these long alternative 3' ends, different regulatory functions can be foreseen, depending on the location where transcription starts.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions / chemistry*
  • Animals
  • Base Sequence
  • Cells, Cultured
  • Conserved Sequence
  • DNA, Complementary / chemistry
  • Databases, Nucleic Acid
  • Expressed Sequence Tags / chemistry
  • Humans
  • Mice
  • Mice, Inbred BALB C
  • Poly A / analysis*
  • Polyadenylation
  • Protein Isoforms / biosynthesis
  • Protein Isoforms / genetics
  • Reverse Transcriptase Polymerase Chain Reaction

Substances

  • 3' Untranslated Regions
  • DNA, Complementary
  • Protein Isoforms
  • Poly A