Analysis of high-order SNP barcodes in mitochondrial D-loop for chronic dialysis susceptibility

J Biomed Inform. 2016 Oct:63:112-119. doi: 10.1016/j.jbi.2016.08.009. Epub 2016 Aug 6.

Abstract

Objectives: Positively identifying disease-associated single nucleotide polymorphism (SNP) markers in genome-wide studies entails the complex association analysis of a huge number of SNPs. Such large numbers of SNP barcode (SNP/genotype combinations) continue to pose serious computational challenges, especially for high-dimensional data.

Methods: We propose a novel exploiting SNP barcode method based on differential evolution, termed IDE (improved differential evolution). IDE uses a "top combination strategy" to improve the ability of differential evolution to explore high-order SNP barcodes in high-dimensional data.

Results: We simulate disease data and use real chronic dialysis data to test four global optimization algorithms. In 48 simulated disease models, we show that IDE outperforms existing global optimization algorithms in terms of exploring ability and power to detect the specific SNP/genotype combinations with a maximum difference between cases and controls. In real data, we show that IDE can be used to evaluate the relative effects of each individual SNP on disease susceptibility.

Conclusion: IDE generated significant SNP barcode with less computational complexity than the other algorithms, making IDE ideally suited for analysis of high-order SNP barcodes.

Keywords: Differential evolution; SNP barcode; Single nucleotide polymorphism.

MeSH terms

  • Algorithms*
  • DNA, Mitochondrial*
  • Electronic Data Processing*
  • Genotype
  • Humans
  • Polymorphism, Single Nucleotide*
  • Renal Dialysis / statistics & numerical data

Substances

  • DNA, Mitochondrial