Computational analysis of full-length cDNAs reveals frequent coupling between transcriptional and splicing programs

DNA Res. 2008 Apr 30;15(2):63-72. doi: 10.1093/dnares/dsm036. Epub 2008 Feb 14.

Abstract

High-throughput sequencing studies revealed that the majority of human and mouse multi-exon genes have multiple splice forms. High-density oligonucleotide array-based measurements have further established that many exons are expressed in a tissue-specific manner. The mechanisms underlying the tissue-dependent expression of most alternative exons remain, however, to be understood. In this study, we focus on one possible mechanism, namely the coupling of (tissue specific) transcription regulation with alternative splicing. We analyzed the FANTOM3 and H-Invitational datasets of full-length mouse and human cDNAs, respectively, and found that in transcription units with multiple start sites, the inclusion of at least 15% and possibly up to 30% of the 'cassette' exons correlates with the use of specific transcription start sites (TSS). The vast majority of TSS-associated exons are conserved between human and mouse, yet the conservation is weaker when compared with TSS-independent exons. Additionally, the currently available data only support a weak correlation between the probabilities of TSS association of orthologous exons. Our analysis thus suggests frequent coupling of transcriptional and splicing programs, and provides a large dataset of exons on which the molecular basis of this coupling can be further studied.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Animals
  • Computational Biology / methods*
  • DNA, Complementary / genetics*
  • Exons / genetics
  • Gene Expression Regulation*
  • Humans
  • Mice
  • RNA, Messenger / genetics
  • Signal Transduction
  • Transcription, Genetic*

Substances

  • DNA, Complementary
  • RNA, Messenger