Evaluating the accuracy of Listeria monocytogenes assemblies from quasimetagenomic samples using long and short reads
- PMID: 34039264
- PMCID: PMC8157722
- DOI: 10.1186/s12864-021-07702-2
Evaluating the accuracy of Listeria monocytogenes assemblies from quasimetagenomic samples using long and short reads
Abstract
Background: Whole genome sequencing of cultured pathogens is the state of the art public health response for the bioinformatic source tracking of illness outbreaks. Quasimetagenomics can substantially reduce the amount of culturing needed before a high quality genome can be recovered. Highly accurate short read data is analyzed for single nucleotide polymorphisms and multi-locus sequence types to differentiate strains but cannot span many genomic repeats, resulting in highly fragmented assemblies. Long reads can span repeats, resulting in much more contiguous assemblies, but have lower accuracy than short reads.
Results: We evaluated the accuracy of Listeria monocytogenes assemblies from enrichments (quasimetagenomes) of naturally-contaminated ice cream using long read (Oxford Nanopore) and short read (Illumina) sequencing data. Accuracy of ten assembly approaches, over a range of sequencing depths, was evaluated by comparing sequence similarity of genes in assemblies to a complete reference genome. Long read assemblies reconstructed a circularized genome as well as a 71 kbp plasmid after 24 h of enrichment; however, high error rates prevented high fidelity gene assembly, even at 150X depth of coverage. Short read assemblies accurately reconstructed the core genes after 28 h of enrichment but produced highly fragmented genomes. Hybrid approaches demonstrated promising results but had biases based upon the initial assembly strategy. Short read assemblies scaffolded with long reads accurately assembled the core genes after just 24 h of enrichment, but were highly fragmented. Long read assemblies polished with short reads reconstructed a circularized genome and plasmid and assembled all the genes after 24 h enrichment but with less fidelity for the core genes than the short read assemblies.
Conclusion: The integration of long and short read sequencing of quasimetagenomes expedited the reconstruction of a high quality pathogen genome compared to either platform alone. A new and more complete level of information about genome structure, gene order and mobile elements can be added to the public health response by incorporating long read analyses with the standard short read WGS outbreak response.
Keywords: Assembly; Listeria; Metagenomics; Nanopore; Quasimetagenomics; Source tracking.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
Similar articles
-
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11. Genomics. 2021. PMID: 33716184
-
Quasimetagenomic source tracking of Listeria monocytogenes from naturally contaminated ice cream.BMC Infect Dis. 2020 Jan 29;20(1):83. doi: 10.1186/s12879-019-4747-z. BMC Infect Dis. 2020. PMID: 31996135 Free PMC article.
-
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing.BMC Genomics. 2019 Jan 9;20(1):23. doi: 10.1186/s12864-018-5381-7. BMC Genomics. 2019. PMID: 30626323 Free PMC article.
-
Chromosome-level hybrid de novo genome assemblies as an attainable option for nonmodel insects.Mol Ecol Resour. 2020 Sep;20(5):1277-1293. doi: 10.1111/1755-0998.13176. Epub 2020 Jun 7. Mol Ecol Resour. 2020. PMID: 32329220 Review.
-
Complex genome assembly based on long-read sequencing.Brief Bioinform. 2022 Sep 20;23(5):bbac305. doi: 10.1093/bib/bbac305. Brief Bioinform. 2022. PMID: 35940845 Review.
Cited by
-
Harmonization of supervised machine learning practices for efficient source attribution of Listeria monocytogenes based on genomic data.BMC Genomics. 2023 Sep 22;24(1):560. doi: 10.1186/s12864-023-09667-w. BMC Genomics. 2023. PMID: 37736708 Free PMC article.
-
Precision metagenomics sequencing for food safety: hybrid assembly of Shiga toxin-producing Escherichia coli in enriched agricultural water.Front Microbiol. 2023 Aug 31;14:1221668. doi: 10.3389/fmicb.2023.1221668. eCollection 2023. Front Microbiol. 2023. PMID: 37720160 Free PMC article.
-
The composition of environmental microbiota in three tree fruit packing facilities changed over seasons and contained taxa indicative of L. monocytogenes contamination.Microbiome. 2023 Jun 5;11(1):128. doi: 10.1186/s40168-023-01544-8. Microbiome. 2023. PMID: 37271802 Free PMC article.
-
Application of MinION sequencing as a tool for the rapid detection and characterization of Listeria monocytogenes in smoked salmon.Front Microbiol. 2022 Aug 10;13:931810. doi: 10.3389/fmicb.2022.931810. eCollection 2022. Front Microbiol. 2022. PMID: 36033887 Free PMC article.
-
The Saprophytic Lifestyle of Listeria monocytogenes and Entry Into the Food-Processing Environment.Front Microbiol. 2022 Mar 8;13:789801. doi: 10.3389/fmicb.2022.789801. eCollection 2022. Front Microbiol. 2022. PMID: 35350628 Free PMC article. Review.
References
-
- Centers for Disease Control and Prevention (CDC) Establishment of a national surveillance program for antimicrobial resistance in Salmonella. MMWR Morb Mortal Wkly Rep. 1996;45:110–111. - PubMed
-
- Tollefson L. FDA reveals plans for antimicrobial susceptibility monitoring. J Am Vet Med Assoc. 1996;208(4):459–460. - PubMed
-
- Davis S, Pettengill JB, Luo Y, Payne J, Shpuntoff A, Rand H, et al. CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data. PeerJ Comput Sci. 2015:e20. 10.7717/peerj-cs.20.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
