Noncoding sequences near duplicated genes evolve rapidly

Genome Biol Evol. 2010;2:518-33. doi: 10.1093/gbe/evq037. Epub 2010 Jun 29.

Abstract

Gene expression divergence and chromosomal rearrangements have been put forward as major contributors to phenotypic differences between closely related species. It has also been established that duplicated genes show enhanced rates of positive selection in their amino acid sequences. If functional divergence is largely due to changes in gene expression, it follows that regulatory sequences in duplicated loci should also evolve rapidly. To investigate this hypothesis, we performed likelihood ratio tests (LRTs) on all noncoding loci within 5 kb of every transcript in the human genome and identified sequences with increased substitution rates in the human lineage since divergence from Old World Monkeys. The fraction of rapidly evolving loci is significantly higher nearby genes that duplicated in the common ancestor of humans and chimps compared with nonduplicated genes. We also conducted a genome-wide scan for nucleotide substitutions predicted to affect transcription factor binding. Rates of binding site divergence are elevated in noncoding sequences of duplicated loci with accelerated substitution rates. Many of the genes associated with these fast-evolving genomic elements belong to functional categories identified in previous studies of positive selection on amino acid sequences. In addition, we find enrichment for accelerated evolution nearby genes involved in establishment and maintenance of pregnancy, processes that differ significantly between humans and monkeys. Our findings support the hypothesis that adaptive evolution of the regulation of duplicated genes has played a significant role in human evolution.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • 5' Untranslated Regions
  • Animals
  • Binding Sites / genetics
  • Cercopithecidae / genetics
  • Evolution, Molecular*
  • Exons
  • Female
  • Gene Duplication / genetics*
  • Genome, Human*
  • Genome-Wide Association Study
  • Humans
  • Macaca / genetics
  • Models, Genetic
  • Pan troglodytes / genetics
  • Pregnancy
  • Pregnancy Maintenance / genetics
  • RNA, Untranslated / genetics*
  • Sequence Alignment
  • Species Specificity
  • Transcription Factors / metabolism

Substances

  • 5' Untranslated Regions
  • RNA, Untranslated
  • Transcription Factors