Endogenous retrovirus long terminal repeats as ready-to-use mobile promoters: the case of primate beta3GAL-T5

Gene. 2005 Dec 30:364:2-12. doi: 10.1016/j.gene.2005.05.045. Epub 2005 Aug 22.

Abstract

Throughout the course of vertebrate evolution, germline retroviral infections have resulted in heritable provirus insertions into host DNA. These endogenous retroviruses (ERVs) contain long terminal repeat (LTR) promoters that can be adopted for use by nearby host genes. It is not known whether the transcription factor (TF) binding sites and tissue-specificities of modern LTR gene promoters have been retained since the time of ERV insertion, or if these features evolved later as the LTR became involved in host gene regulation. To address this issue, we have conducted a case study of the ERV-L LTR promoter of human beta1,3-galactosyltransferase 5 (beta3GAL-T5). We have previously shown that the human beta3GAL-T5 LTR promoter is responsible for the majority of gene transcripts in the colon. The murine beta3gal-t5 gene is also expressed primarily in the colon, despite the absence of an orthologous ERV-L LTR in the mouse genome. We therefore hypothesized that both the ERV-L LTR and the non-retroviral ancestral beta3GAL-T5 promoter were active in the colon at the time of ERV insertion. In support of this hypothesis, we have shown that the orthologous LTRs of four non-human primates are also active in a human colorectal cell line, and that the baboon LTR is active in primary baboon colon tissue. We also present evidence that the functional TF binding sites of the human beta3GAL-T5 LTR promoter were present in the original consensus sequence for this class of LTRs. Upon similar analysis of other ERV sequences, we have concluded that this evolutionary history is shared by certain other LTR gene promoters, and may be a general phenomenon.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Binding Sites
  • Colon / virology
  • DNA Primers
  • Endogenous Retroviruses / genetics*
  • Galactosyltransferases / genetics*
  • Humans
  • Male
  • Mice
  • Molecular Sequence Data
  • Papio / virology
  • Primates
  • Promoter Regions, Genetic*
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Terminal Repeat Sequences
  • Testis / enzymology
  • Transcription Factors / metabolism
  • Transcription, Genetic
  • Viral Proteins / genetics

Substances

  • DNA Primers
  • Transcription Factors
  • Viral Proteins
  • B3GALT5 protein, human
  • Galactosyltransferases