Performance and Accuracy of Four Open-Source Tools for In Silico Serotyping of Salmonella spp. Based on Whole-Genome Short-Read Sequencing Data
- PMID: 31862714
- PMCID: PMC7028957
- DOI: 10.1128/AEM.02265-19
Performance and Accuracy of Four Open-Source Tools for In Silico Serotyping of Salmonella spp. Based on Whole-Genome Short-Read Sequencing Data
Abstract
We compared the performance of four open-source in silico Salmonella typing tools (SeqSero, SeqSero2, Salmonella In Silico Typing Resource [SISTR], and Metric Oriented Sequence Typer [MOST]) to assess their potential for replacing laboratory serological testing with serovar predictions from whole-genome sequencing data. We conducted a retrospective analysis of 1,624 Salmonella isolates of 72 serovars submitted to the German National Salmonella Reference Laboratory between 1999 and 2019. All isolates are derived from animal and foodstuff origins. We conducted Illumina short-read sequencing and compared the in silico serovar prediction results with the results of routine laboratory serotyping. We found the best-performing in silico serovar prediction tool to be SISTR, with 94% correctly typed isolates, followed by SeqSero2 (87%), SeqSero (81%), and MOST (79%). Furthermore, we found that mapping-based tools like SeqSero and SeqSero2 (allele mode) were more reliable for the prediction of monophasic variants, while sequence type and cluster-based methods like MOST and SISTR (core-genome multilocus sequence type [cgMLST]), showed greater resilience when confronted with GC-biased sequencing data. We showed that the choice of library preparation kit could substantially affect O antigen detection, due to the low GC content of the wzx and wzy genes. Although the accuracy of computational serovar predictions is still not quite on par with traditional serotyping by Salmonella reference laboratories, the command-line tools investigated in this study perform a rapid, efficient, inexpensive, and reproducible analysis, which can be integrated into in-house characterization pipelines. Based on our results, we find SISTR most suitable for automated, routine serotyping for public health surveillance of SalmonellaIMPORTANCESalmonella spp. are important foodborne pathogens. To reduce the number of infected patients, it is essential to understand which subtypes of the bacteria cause disease outbreaks. Traditionally, characterization of Salmonella requires serological testing, a laboratory method by which Salmonella isolates can be classified into over 2,600 distinct subtypes, called serovars. Due to recent advances in whole-genome sequencing, many tools have been developed to replace traditional testing methods with computational analysis of genome sequences. It is crucial to validate that these tools, many already in use for routine surveillance, deliver accurate and reliable serovar information. In this study, we set out to compare which of the currently available open-source command-line tools is most suitable to replace serological testing. A thorough evaluation of the differing computational approaches is highly important to ensure the backward compatibility of serotyping data and to maintain comparability between laboratories.
Keywords: O antigen; Salmonella; serotyping; serovar prediction; whole-genome sequencing.
Copyright © 2020 Uelze et al.
Figures
Comment in
-
GC Content-Associated Sequencing Bias Caused by Library Preparation Method May Infrequently Affect Salmonella Serotype Prediction Using SeqSero2.Appl Environ Microbiol. 2020 Sep 1;86(18):e00614-20. doi: 10.1128/AEM.00614-20. Print 2020 Sep 1. Appl Environ Microbiol. 2020. PMID: 32680856 Free PMC article. No abstract available.
-
Reply to Li et al., "GC Content-Associated Sequencing Bias Caused by Library Preparation Method May Infrequently Affect Salmonella Serotype Prediction Using SeqSero2".Appl Environ Microbiol. 2020 Sep 1;86(18):e01260-20. doi: 10.1128/AEM.01260-20. Print 2020 Sep 1. Appl Environ Microbiol. 2020. PMID: 32680857 Free PMC article. No abstract available.
Similar articles
-
SeqSero2: Rapid and Improved Salmonella Serotype Determination Using Whole-Genome Sequencing Data.Appl Environ Microbiol. 2019 Nov 14;85(23):e01746-19. doi: 10.1128/AEM.01746-19. Print 2019 Dec 1. Appl Environ Microbiol. 2019. PMID: 31540993 Free PMC article.
-
Salmonella Serotyping Using Whole Genome Sequencing.Front Microbiol. 2018 Dec 13;9:2993. doi: 10.3389/fmicb.2018.02993. eCollection 2018. Front Microbiol. 2018. PMID: 30619114 Free PMC article.
-
Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico Typing Resource (SISTR).Microb Genom. 2018 Feb;4(2):e000151. doi: 10.1099/mgen.0.000151. Epub 2018 Jan 17. Microb Genom. 2018. PMID: 29338812 Free PMC article.
-
Molecular methods for serovar determination of Salmonella.Crit Rev Microbiol. 2015;41(3):309-25. doi: 10.3109/1040841X.2013.837862. Epub 2013 Nov 14. Crit Rev Microbiol. 2015. PMID: 24228625 Review.
-
A genomic overview of the population structure of Salmonella.PLoS Genet. 2018 Apr 5;14(4):e1007261. doi: 10.1371/journal.pgen.1007261. eCollection 2018 Apr. PLoS Genet. 2018. PMID: 29621240 Free PMC article. Review.
Cited by
-
Oxford nanopore technologies-a valuable tool to generate whole-genome sequencing data for in silico serotyping and the detection of genetic markers in Salmonella.Front Vet Sci. 2023 Jun 1;10:1178922. doi: 10.3389/fvets.2023.1178922. eCollection 2023. Front Vet Sci. 2023. PMID: 37323838 Free PMC article.
-
Genomic Surveillance of Salmonella from the Comunitat Valenciana (Spain).Antibiotics (Basel). 2023 May 9;12(5):883. doi: 10.3390/antibiotics12050883. Antibiotics (Basel). 2023. PMID: 37237786 Free PMC article.
-
Are Enterobacteriaceae and Enterococcus Isolated from Powdered Infant Formula a Hazard for Infants? A Genomic Analysis.Foods. 2022 Nov 8;11(22):3556. doi: 10.3390/foods11223556. Foods. 2022. PMID: 36429148 Free PMC article.
-
Phenotypic and genotypic characterization of antimicrobial resistance profiles in Salmonella isolated from waterfowl in 2002-2005 and 2018-2020 in Sichuan, China.Front Microbiol. 2022 Oct 6;13:987613. doi: 10.3389/fmicb.2022.987613. eCollection 2022. Front Microbiol. 2022. PMID: 36274743 Free PMC article.
-
Combination of Whole Genome Sequencing and Metagenomics for Microbiological Diagnostics.Int J Mol Sci. 2022 Aug 30;23(17):9834. doi: 10.3390/ijms23179834. Int J Mol Sci. 2022. PMID: 36077231 Free PMC article. Review.
References
-
- Yoshida CE, Kruczkiewicz P, Laing CR, Lingohr EJ, Gannon VPJ, Nash JHE, Taboada EN. 2016. The Salmonella In Silico Typing Resource (SISTR): an open web-accessible tool for rapidly typing and subtyping draft Salmonella genome assemblies. PLoS One 11:e0147101. doi:10.1371/journal.pone.0147101. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous
