A large and complex structural polymorphism at 16p12.1 underlies microdeletion disease risk

Nat Genet. 2010 Sep;42(9):745-50. doi: 10.1038/ng.643. Epub 2010 Aug 22.


There is a complex relationship between the evolution of segmental duplications and rearrangements associated with human disease. We performed a detailed analysis of one region on chromosome 16p12.1 associated with neurocognitive disease and identified one of the largest structural inconsistencies in the human reference assembly. Various genomic analyses show that all examined humans are homozygously inverted relative to the reference genome for a 1.1-Mb region on 16p12.1. We determined that this assembly discrepancy stems from two common structural configurations with worldwide frequencies of 17.6% (S1) and 82.4% (S2). This polymorphism arose from the rapid integration of segmental duplications, precipitating two local inversions within the human lineage over the last 10 million years. The two human haplotypes differ by 333 kb of additional duplicated sequence present in S2 but not in S1. Notably, we show that the S2 configuration harbors directly oriented duplications, specifically predisposing this chromosome to disease-associated rearrangement.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Cell Line, Tumor
  • Chromosome Deletion*
  • Chromosome Disorders / genetics*
  • Chromosome Mapping / standards
  • Chromosomes, Human, Pair 16* / chemistry
  • Chromosomes, Human, Pair 16* / genetics
  • Comparative Genomic Hybridization
  • Gene Dosage
  • Genetic Predisposition to Disease
  • Genetics, Population
  • Humans
  • Molecular Sequence Data
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Genetic*
  • Primates / genetics
  • Research Design
  • Risk

Associated data

  • GENBANK/AC009124
  • GENBANK/AC120780
  • GENBANK/AC142201
  • GENBANK/AC142205
  • GENBANK/AC142206
  • GENBANK/AC145243
  • GENBANK/AC183100
  • GENBANK/AC183619
  • GENBANK/AC183674
  • GENBANK/AC183685
  • GENBANK/AC196535
  • GENBANK/AC206011
  • GENBANK/AC206441
  • GENBANK/AC207090
  • GENBANK/BK007104