Impact of Pre-Analytical Variables on Cancer Targeted Gene Sequencing Efficiency

PLoS One. 2015 Nov 25;10(11):e0143092. doi: 10.1371/journal.pone.0143092. eCollection 2015.

Abstract

Tumor specimens are often preserved as formalin-fixed paraffin-embedded (FFPE) tissue blocks, the most common clinical source for DNA sequencing. Herein, we evaluated the effect of pre-sequencing parameters to guide proper sample selection for targeted gene sequencing. Data from 113 FFPE lung tumor specimens were collected, and targeted gene sequencing was performed. Libraries were constructed using custom probes and were paired-end sequenced on a next generation sequencing platform. A PCR-based quality control (QC) assay was utilized to determine DNA quality, and a ratio was generated in comparison to control DNA. We observed that FFPE storage time, PCR/QC ratio, and DNA input in the library preparation were significantly correlated to most parameters of sequencing efficiency including depth of coverage, alignment rate, insert size, and read quality. A combined score using the three parameters was generated and proved highly accurate to predict sequencing metrics. We also showed wide read count variability within the genome, with worse coverage in regions of low GC content like in KRAS. Sample quality and GC content had independent effects on sequencing depth, and the worst results were observed in regions of low GC content in samples with poor quality. Our data confirm that FFPE samples are a reliable source for targeted gene sequencing in cancer, provided adequate sample quality controls are exercised. Tissue quality should be routinely assessed for pre-analytical factors, and sequencing depth may be limited in genomic regions of low GC content if suboptimal samples are utilized.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor
  • DNA Mutational Analysis / methods
  • DNA Mutational Analysis / standards
  • Humans
  • Lung Neoplasms / diagnosis
  • Lung Neoplasms / genetics
  • Mutation
  • Neoplasms / diagnosis
  • Neoplasms / genetics*
  • Quality Control
  • Reproducibility of Results
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, DNA / standards*

Substances

  • Biomarkers, Tumor