Summary: We present MetaMarker, a pipeline for discovering metagenomic biomarkers from whole-metagenome sequencing samples. Different from existing methods, MetaMarker is based on a de novo approach that does not require mapping raw reads to a reference database. We applied MetaMarker on whole-metagenome sequencing of colorectal cancer (CRC) stool samples from France to discover CRC specific metagenomic biomarkers. We showed robustness of the discovered biomarkers by validating in independent samples from Hong Kong, Austria, Germany and Denmark. We further demonstrated these biomarkers could be used to build a machine learning classifier for CRC prediction.
Availability and implementation: MetaMarker is freely available at https://bitbucket.org/mkoohim/metamarker under GPLv3 license.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: email@example.com.
Metagenomic analysis of faecal microbiome as a tool towards targeted non-invasive biomarkers for colorectal cancer.Gut. 2017 Jan;66(1):70-78. doi: 10.1136/gutjnl-2015-309800. Epub 2015 Sep 25. Gut. 2017. PMID: 26408641
Alterations in Enteric Virome Are Associated With Colorectal Cancer and Survival Outcomes.Gastroenterology. 2018 Aug;155(2):529-541.e5. doi: 10.1053/j.gastro.2018.04.018. Epub 2018 Apr 22. Gastroenterology. 2018. PMID: 29689266
MOCAT2: a metagenomic assembly, annotation and profiling framework.Bioinformatics. 2016 Aug 15;32(16):2520-3. doi: 10.1093/bioinformatics/btw183. Epub 2016 Apr 8. Bioinformatics. 2016. PMID: 27153620 Free PMC article.
NGSPanPipe: A Pipeline for Pan-genome Identification in Microbial Strains from Experimental Reads.Adv Exp Med Biol. 2018;1052:39-49. doi: 10.1007/978-981-10-7572-8_4. Adv Exp Med Biol. 2018. PMID: 29785479 Review.
Bioinformatics tools for quantitative and functional metagenome and metatranscriptome data analysis in microbes.Brief Bioinform. 2018 Nov 27;19(6):1415-1429. doi: 10.1093/bib/bbx051. Brief Bioinform. 2018. PMID: 28481971 Review.