An integrated approach for analyzing clinical genomic variant data from next-generation sequencing

J Biomol Tech. 2015 Apr;26(1):19-28. doi: 10.7171/jbt.15-2601-002.

Abstract

Next-generation sequencing (NGS) technologies provide the potential for developing high-throughput and low-cost platforms for clinical diagnostics. A limiting factor to clinical applications of genomic NGS is downstream bioinformatics analysis for data interpretation. We have developed an integrated approach for end-to-end clinical NGS data analysis from variant detection to functional profiling. Robust bioinformatics pipelines were implemented for genome alignment, single nucleotide polymorphism (SNP), small insertion/deletion (InDel), and copy number variation (CNV) detection of whole exome sequencing (WES) data from the Illumina platform. Quality-control metrics were analyzed at each step of the pipeline by use of a validated training dataset to ensure data integrity for clinical applications. We annotate the variants with data regarding the disease population and variant impact. Custom algorithms were developed to filter variants based on criteria, such as quality of variant, inheritance pattern, and impact of variant on protein function. The developed clinical variant pipeline links the identified rare variants to Integrated Genome Viewer for visualization in a genomic context and to the Protein Information Resource's iProXpress for rich protein and disease information. With the application of our system of annotations, prioritizations, inheritance filters, and functional profiling and analysis, we have created a unique methodology for downstream variant filtering that empowers clinicians and researchers to interpret more effectively the relevance of genomic alterations within a rare genetic disease.

Keywords: Mendelian Genetics; bioinformatics; genetic alterations; protein information resources.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Craniofacial Abnormalities / genetics
  • DNA Copy Number Variations
  • Female
  • Gene Ontology
  • Genetic Association Studies*
  • Genomics
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • INDEL Mutation
  • Male
  • Pedigree
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, DNA*