BHap: a novel approach for bacterial haplotype reconstruction
- PMID: 31004480
- PMCID: PMC6931272
- DOI: 10.1093/bioinformatics/btz280
BHap: a novel approach for bacterial haplotype reconstruction
Abstract
Motivation: The bacterial haplotype reconstruction is critical for selecting proper treatments for diseases caused by unknown haplotypes. Existing methods and tools do not work well on this task, because they are usually developed for viral instead of bacterial populations.
Results: In this study, we developed BHap, a novel algorithm based on fuzzy flow networks, for reconstructing bacterial haplotypes from next generation sequencing data. Tested on simulated and experimental datasets, we showed that BHap was capable of reconstructing haplotypes of bacterial populations with an average F1 score of 0.87, an average precision of 0.87 and an average recall of 0.88. We also demonstrated that BHap had a low susceptibility to sequencing errors, was capable of reconstructing haplotypes with low coverage and could handle a wide range of mutation rates. Compared with existing approaches, BHap outperformed them in terms of higher F1 scores, better precision, better recall and more accurate estimation of the number of haplotypes.
Availability and implementation: The BHap tool is available at http://www.cs.ucf.edu/∼xiaoman/BHap/.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Figures
Similar articles
-
Evaluation of haplotype callers for next-generation sequencing of viruses.Infect Genet Evol. 2020 Aug;82:104277. doi: 10.1016/j.meegid.2020.104277. Epub 2020 Mar 6. Infect Genet Evol. 2020. PMID: 32151775 Free PMC article.
-
Reconstructing viral haplotypes using long reads.Bioinformatics. 2022 Apr 12;38(8):2127-2134. doi: 10.1093/bioinformatics/btac089. Bioinformatics. 2022. PMID: 35157018
-
De novo haplotype reconstruction in viral quasispecies using paired-end read guided path finding.Bioinformatics. 2018 Sep 1;34(17):2927-2935. doi: 10.1093/bioinformatics/bty202. Bioinformatics. 2018. PMID: 29617936
-
mixtureS: a novel tool for bacterial strain genome reconstruction from reads.Bioinformatics. 2021 May 1;37(4):575-577. doi: 10.1093/bioinformatics/btaa728. Bioinformatics. 2021. PMID: 32805048 Free PMC article.
-
Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes.Brief Bioinform. 2014 May;15(3):431-42. doi: 10.1093/bib/bbs081. Epub 2012 Dec 19. Brief Bioinform. 2014. PMID: 23257116 Review.
Cited by
-
Are the predicted known bacterial strains in a sample really present? A case study.PLoS One. 2023 Oct 13;18(10):e0291964. doi: 10.1371/journal.pone.0291964. eCollection 2023. PLoS One. 2023. PMID: 37831725 Free PMC article.
-
cgMSI: pathogen detection within species from nanopore metagenomic sequencing data.BMC Bioinformatics. 2023 Oct 12;24(1):387. doi: 10.1186/s12859-023-05512-9. BMC Bioinformatics. 2023. PMID: 37821827 Free PMC article.
-
Statistical and Computational Methods for Microbial Strain Analysis.Methods Mol Biol. 2023;2629:231-245. doi: 10.1007/978-1-0716-2986-4_11. Methods Mol Biol. 2023. PMID: 36929080 Review.
-
Pitfalls of genotyping microbial communities with rapidly growing genome collections.Cell Syst. 2023 Feb 15;14(2):160-176.e3. doi: 10.1016/j.cels.2022.12.007. Epub 2023 Jan 18. Cell Syst. 2023. PMID: 36657438 Free PMC article.
-
Scalable Microbial Strain Inference in Metagenomic Data Using StrainFacts.Front Bioinform. 2022 May 16;2:867386. doi: 10.3389/fbinf.2022.867386. eCollection 2022. Front Bioinform. 2022. PMID: 36304283 Free PMC article.
References
-
- Glenn T.C. (2011) Field guide to next‐generation DNA sequencers. Mol. Ecol. Resour., 11, 759–769. - PubMed
