The extent of linkage disequilibrium and computational challenges of single nucleotide polymorphisms in genome-wide association studies

Curr Drug Metab. 2011 Jun;12(5):498-506. doi: 10.2174/138920011795495312.

Abstract

Single Nucleotide Polymorphisms (SNPs) are the most abundant form of genetic variations observed in the human genome. With the advent of high-throughput genotyping arrays and next-generation sequencing (NGS) platforms, tens of millions of SNPs have been uncovered in several human populations. However, the huge amount of SNPs bring new challenges in subsequent analysis. In reality, a number of SNPs may not be genotyped, and non-mutant bases may be falsely reported as SNPs in the microarray and NGS platforms. Furthermore, the identification of disease susceptibility genes are often confounded by numerous SNPs correlated by chance. In this paper, we review existing approaches for calling SNPs using microarrays and next-generation sequencing (NGS) platforms. Methods for measuring linkage disequilibrium (LD) and applications of the LD structure are discussed. Finally, we compare methods for inferring haplotypes from genotypes using microarray and NGS platforms and present the challenges of using SNPs in large-scale association studies.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Genome-Wide Association Study / methods*
  • Genome-Wide Association Study / statistics & numerical data*
  • Genotype
  • Haplotypes / genetics
  • Humans
  • Linkage Disequilibrium / genetics*
  • Models, Statistical*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Polymorphism, Single Nucleotide / genetics*