Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm

Genomics. 2011 Dec;98(6):422-30. doi: 10.1016/j.ygeno.2011.08.007. Epub 2011 Aug 28.

Abstract

Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Asia, Eastern
  • Asian People / genetics*
  • Black or African American / genetics*
  • Genome, Human
  • Genome-Wide Association Study / methods*
  • Genotype
  • Hispanic or Latino / genetics*
  • Humans
  • Oligonucleotide Array Sequence Analysis / methods
  • Pilot Projects
  • Polymorphism, Single Nucleotide*
  • White People / genetics