From single-SNP to wide-locus: genome-wide association studies identifying functionally related genes and intragenic regions in small sample studies

Pharmacogenomics. 2013 Mar;14(4):391-401. doi: 10.2217/pgs.13.28.

Abstract

Background: Genome-wide association studies (GWAS) have had limited success when applied to complex diseases. Analyzing SNPs individually requires several large studies to integrate the often divergent results. In the presence of epistasis, multivariate approaches based on the linear model (including stepwise logistic regression) often have low sensitivity and generate an abundance of artifacts.

Methods: Recent advances in distributed and parallel processing spurred methodological advances in nonparametric statistics. U-statistics for structured multivariate data (µStat) are not confounded by unrealistic assumptions (e.g., linearity, independence).

Results: By incorporating knowledge about relationships between SNPs, µGWAS (GWAS based on µStat) can identify clusters of genes around biologically relevant pathways and pinpoint functionally relevant regions within these genes.

Conclusion: With this computational biostatistics approach increasing power and guarding against artifacts, personalized medicine and comparative effectiveness will advance while subgroup analyses of Phase III trials can now suggest risk factors for adverse events and novel directions for drug development.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Clinical Trials, Phase III as Topic
  • Epilepsy / genetics*
  • Epistasis, Genetic
  • Genome-Wide Association Study / statistics & numerical data*
  • Humans
  • Metabolic Networks and Pathways / genetics*
  • Polymorphism, Single Nucleotide / genetics
  • Statistics, Nonparametric*
  • ras Proteins / genetics

Substances

  • ras Proteins