A comprehensive quality control workflow for paired tumor-normal NGS experiments
- PMID: 28130233
- DOI: 10.1093/bioinformatics/btx032
A comprehensive quality control workflow for paired tumor-normal NGS experiments
Abstract
Summary: Quality control (QC) is an important part of all NGS data analysis stages. Many available tools calculate QC metrics from different analysis steps of single sample experiments (raw reads, mapped reads and variant lists). Multi-sample experiments, as sequencing of tumor-normal pairs, require additional QC metrics to ensure validity of results. These multi-sample QC metrics still lack standardization. We therefore suggest a new workflow for QC of DNA sequencing of tumor-normal pairs. With this workflow well-known single-sample QC metrics and additional metrics specific for tumor-normal pairs can be calculated. The segmentation into different tools offers a high flexibility and allows reuse for other purposes. All tools produce qcML, a generic XML format for QC of -omics experiments. qcML uses quality metrics defined in an ontology, which was adapted for NGS.
Availability and implementation: All QC tools are implemented in C ++ and run both under Linux and Windows. Plotting requires python 2.7 and matplotlib. The software is available under the 'GNU General Public License version 2' as part of the ngs-bits project: https://github.com/imgag/ngs-bits.
Contact: christopher.schroeder@med.uni-tuebingen.de.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Similar articles
-
ChronQC: a quality control monitoring system for clinical next generation sequencing.Bioinformatics. 2018 May 15;34(10):1799-1800. doi: 10.1093/bioinformatics/btx843. Bioinformatics. 2018. PMID: 29300845
-
KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies.Bioinformatics. 2017 Feb 15;33(4):574-576. doi: 10.1093/bioinformatics/btw663. Bioinformatics. 2017. PMID: 27797770 Free PMC article.
-
QC-Chain: fast and holistic quality control method for next-generation sequencing data.PLoS One. 2013;8(4):e60234. doi: 10.1371/journal.pone.0060234. Epub 2013 Apr 2. PLoS One. 2013. PMID: 23565205 Free PMC article.
-
Comprehensive fundamental somatic variant calling and quality management strategies for human cancer genomes.Brief Bioinform. 2021 May 20;22(3):bbaa083. doi: 10.1093/bib/bbaa083. Brief Bioinform. 2021. PMID: 32510555 Review.
-
Prevention, diagnosis and treatment of high-throughput sequencing data pathologies.Mol Ecol. 2014 Apr;23(7):1679-700. doi: 10.1111/mec.12680. Epub 2014 Mar 13. Mol Ecol. 2014. PMID: 24471475 Review.
Cited by
-
Enrichment of SARM1 alleles encoding variants with constitutively hyperactive NADase in patients with ALS and other motor nerve disorders.Elife. 2021 Nov 19;10:e70905. doi: 10.7554/eLife.70905. Elife. 2021. PMID: 34796871 Free PMC article.
-
Cancer proteogenomics: current impact and future prospects.Nat Rev Cancer. 2022 May;22(5):298-313. doi: 10.1038/s41568-022-00446-5. Epub 2022 Mar 2. Nat Rev Cancer. 2022. PMID: 35236940 Review.
-
Statistical guidelines for quality control of next-generation sequencing techniques.Life Sci Alliance. 2021 Aug 30;4(11):e202101113. doi: 10.26508/lsa.202101113. Print 2021 Nov. Life Sci Alliance. 2021. PMID: 34462322 Free PMC article.
-
Clonal evolution patterns in acute myeloid leukemia with NPM1 mutation.Nat Commun. 2019 May 2;10(1):2031. doi: 10.1038/s41467-019-09745-2. Nat Commun. 2019. PMID: 31048683 Free PMC article.
-
Cancer immune control needs senescence induction by interferon-dependent cell cycle regulator pathways in tumours.Nat Commun. 2020 Mar 12;11(1):1335. doi: 10.1038/s41467-020-14987-6. Nat Commun. 2020. PMID: 32165639 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
