High-resolution mapping and analysis of copy number variations in the human genome: a data resource for clinical and research applications

Genome Res. 2009 Sep;19(9):1682-90. doi: 10.1101/gr.083501.108. Epub 2009 Jul 10.

Abstract

We present a database of copy number variations (CNVs) detected in 2026 disease-free individuals, using high-density, SNP-based oligonucleotide microarrays. This large cohort, comprised mainly of Caucasians (65.2%) and African-Americans (34.2%), was analyzed for CNVs in a single study using a uniform array platform and computational process. We have catalogued and characterized 54,462 individual CNVs, 77.8% of which were identified in multiple unrelated individuals. These nonunique CNVs mapped to 3272 distinct regions of genomic variation spanning 5.9% of the genome; 51.5% of these were previously unreported, and >85% are rare. Our annotation and analysis confirmed and extended previously reported correlations between CNVs and several genomic features such as repetitive DNA elements, segmental duplications, and genes. We demonstrate the utility of this data set in distinguishing CNVs with pathologic significance from normal variants. Together, this analysis and annotation provides a useful resource to assist with the assessment of CNVs in the contexts of human variation, disease susceptibility, and clinical molecular diagnostics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • African Continental Ancestry Group / genetics
  • Child
  • Chromosome Mapping / methods*
  • Databases, Genetic*
  • European Continental Ancestry Group / genetics
  • Gene Dosage / genetics*
  • Gene Duplication
  • Genetic Variation*
  • Genome, Human / genetics*
  • Humans
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide / genetics*
  • Research Design