Unique k-mers as Strain-Specific Barcodes for Phylogenetic Analysis and Natural Microbiome Profiling
- PMID: 32023871
- PMCID: PMC7037511
- DOI: 10.3390/ijms21030944
Unique k-mers as Strain-Specific Barcodes for Phylogenetic Analysis and Natural Microbiome Profiling
Abstract
The need for a comparative analysis of natural metagenomes stimulated the development of new methods for their taxonomic profiling. Alignment-free approaches based on the search for marker k-mers turned out to be capable of identifying not only species, but also strains of microorganisms with known genomes. Here, we evaluated the ability of genus-specific k-mers to distinguish eight phylogroups of Escherichia coli (A, B1, C, E, D, F, G, B2) and assessed the presence of their unique 22-mers in clinical samples from microbiomes of four healthy people and four patients with Crohn's disease. We found that a phylogenetic tree inferred from the pairwise distance matrix for unique 18-mers and 22-mers of 124 genomes was fully consistent with the topology of the tree, obtained with concatenated aligned sequences of orthologous genes. Therefore, we propose strain-specific "barcodes" for rapid phylotyping. Using unique 22-mers for taxonomic analysis, we detected microbes of all groups in human microbiomes; however, their presence in the five samples was significantly different. Pointing to the intraspecies heterogeneity of E. coli in the natural microflora, this also indicates the feasibility of further studies of the role of this heterogeneity in maintaining population homeostasis.
Keywords: alignment-free algorithms; bacterial genomes; genome barcodes; human microbiome; k-mers; metagenomes; phylogenetic trees; phylotyping; taxonomic profiling.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Lacticaseibacillus paracasei: Occurrence in the Human Gut Microbiota and K-Mer-Based Assessment of Intraspecies Diversity.Life (Basel). 2021 Nov 17;11(11):1246. doi: 10.3390/life11111246. Life (Basel). 2021. PMID: 34833122 Free PMC article.
-
Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools.BMC Bioinformatics. 2018 Aug 30;19(1):309. doi: 10.1186/s12859-018-2320-1. BMC Bioinformatics. 2018. PMID: 30165813 Free PMC article.
-
Highlighting new phylogenetic specificities of Crohn's disease microbiota.Inflamm Bowel Dis. 2011 Jan;17(1):185-92. doi: 10.1002/ibd.21436. Epub 2010 Aug 18. Inflamm Bowel Dis. 2011. PMID: 20722058
-
Intestinal colonization with phylogenetic group B2 Escherichia coli related to inflammatory bowel disease: a systematic review and meta-analysis.Scand J Gastroenterol. 2015;50(10):1199-207. doi: 10.3109/00365521.2015.1028993. Epub 2015 Apr 24. Scand J Gastroenterol. 2015. PMID: 25910859 Review.
-
Bacterial characteristics of importance for recurrent urinary tract infections caused by Escherichia coli.Dan Med Bull. 2011 Apr;58(4):B4187. Dan Med Bull. 2011. PMID: 21466767 Review.
Cited by
-
Scalable Microbial Strain Inference in Metagenomic Data Using StrainFacts.Front Bioinform. 2022 May 16;2:867386. doi: 10.3389/fbinf.2022.867386. eCollection 2022. Front Bioinform. 2022. PMID: 36304283 Free PMC article.
-
Lacticaseibacillus paracasei: Occurrence in the Human Gut Microbiota and K-Mer-Based Assessment of Intraspecies Diversity.Life (Basel). 2021 Nov 17;11(11):1246. doi: 10.3390/life11111246. Life (Basel). 2021. PMID: 34833122 Free PMC article.
-
Suppression of Escherichia coli Growth Dynamics via RNAs Secreted by Competing Bacteria.Front Mol Biosci. 2021 Apr 15;8:609979. doi: 10.3389/fmolb.2021.609979. eCollection 2021. Front Mol Biosci. 2021. PMID: 33937321 Free PMC article.
-
Bioinformatics Methods in Medical Genetics and Genomics.Int J Mol Sci. 2020 Aug 28;21(17):6224. doi: 10.3390/ijms21176224. Int J Mol Sci. 2020. PMID: 32872128 Free PMC article.
References
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
