Intragene higher order repeats in neuroblastoma breakpoint family genes distinguish humans from chimpanzees

Mol Biol Evol. 2011 Jun;28(6):1877-92. doi: 10.1093/molbev/msr009. Epub 2011 Jan 27.

Abstract

Much attention has been devoted to identifying genomic patterns underlying the evolution of the human brain and its emergent advanced cognitive capabilities, which lie at the heart of differences distinguishing humans from chimpanzees, our closest living relatives. Here, we identify two particular intragene repeat structures of noncoding human DNA, spanning as much as a hundred kilobases, that are present in human genome but are absent from the chimpanzee genome and other nonhuman primates. Using our novel computational method Global Repeat Map, we examine tandem repeat structure in human and chimpanzee chromosome 1. In human chromosome 1, we find three higher order repeats (HORs), two of them novel, not reported previously, whereas in chimpanzee chromosome 1, we find only one HOR, a 2mer alphoid HOR instead of human alphoid 11mer HOR. In human chromosome 1, we identify an HOR based on 39-bp primary repeat unit, with secondary, tertiary, and quartic repeat units, fully embedded in human hornerin gene, related to regenerating and psoriatric skin. Such an HOR is not found in chimpanzee chromosome 1. We find a remarkable human 3mer HOR organization based on the ~1.6-kb primary repeat unit, fully embedded within the neuroblastoma breakpoint family genes, which is related to the function of the human brain. Such HORs are not present in chimpanzees. In general, we find that human-chimpanzee differences are much larger for tandem repeats, in particularly for HORs, than for gene sequences. This may be of great significance in light of recent studies that are beginning to reveal the large-scale regulatory architecture of the human genome, in particular the role of noncoding sequences. We hypothesize about the possible importance of human accelerated HOR patterns as components in the gene expression multilayered regulatory network.

MeSH terms

  • Alu Elements / genetics
  • Animals
  • Base Sequence
  • Calcium-Binding Proteins / genetics
  • Carrier Proteins / genetics
  • Chromosome Mapping
  • Chromosomes, Human, Pair 1 / genetics
  • Evolution, Molecular
  • Gene Order
  • Gene Regulatory Networks / genetics
  • Genome, Human / genetics
  • Humans
  • Intermediate Filament Proteins / genetics
  • Microsatellite Repeats / genetics
  • Molecular Sequence Data
  • Neoplasm Proteins / genetics*
  • Pan troglodytes / genetics*
  • Sequence Alignment
  • Species Specificity
  • Tandem Repeat Sequences / genetics*

Substances

  • Calcium-Binding Proteins
  • Carrier Proteins
  • HRNR protein, human
  • Intermediate Filament Proteins
  • Neoplasm Proteins