BackgroundRapid and accurate identification of Verotoxigenic Escherichia coli (VTEC) O157:H7 is dependent on well-established, standardized and highly discriminatory typing methods. Currently, conventional subtyping tests for foodborne bacterial pathogen surveillance are rapidly being replaced with whole-genome sequencing (WGS) in public health laboratories. The capacity of WGS to revolutionize global foodborne disease surveillance has positioned this tool to become the new gold standard; however, to ensure evidence standards for public health decision making can still be achieved, the performance of WGS must be thoroughly validated against current gold standard methods prior to implementation. Here we aim to verify the performance of WGS in comparison to pulsed-field gel electrophoresis (PFGE) and multiple-locus variable-number tandem repeat analysis (MLVA) for eight retrospective outbreaks of VTEC O157:H7 from the Canadian perspective. Since real-time implementation and routine use of WGS in public health laboratories is highly reliant on standardized data analysis tools, we also provide a comparative analysis of two popular methodologies for WGS analyses; an in-house developed single nucleotide variant phylogenomics (SNVPhyl) pipeline and the BioNumerics whole genome multilocus sequence typing (wgMLST) tool. To provide a useful and consistent starting point for examining laboratory-based surveillance data for VTEC O157:H7 in Canada, we also aim to describe the number of genetic differences observed among outbreak-associated isolates.ResultsWGS provided enhanced resolution over traditional subtyping methods, and accurately distinguished outbreak-related isolates from non-outbreak related isolates with high epidemiological concordance. WGS also illuminated potential linkages between sporadic cases of illness and contaminated food, and isolates spanning multiple years. The topologies generated by SNVPhyl and wgMLST were highly congruent with strong statistical support. Few genetic differences were observed among outbreak-related isolates (≤5 SNVs/ < 10 wgMLST alleles) unless the outbreak was suspected to be multi-strain.ConclusionsThis study validates the superiority of WGS and indicates the BioNumerics wgMLST schema is suitable for surveillance and cluster detection of VTEC O157:H7. These findings will provide a useful and consistent starting point for examining WGS data for prospective laboratory-based surveillance of VTEC O157:H7, but however, the data will continue to be interpreted according to context and in combination with epidemiological and food safety evidence to inform public-health decision making in Canada.Electronic supplementary materialThe online version of this article (10.1186/s12864-018-5243-3) contains supplementary material, which is available to authorized users.
The lack of pattern diversity among pulsed-field gel electrophoresis (PFGE) profiles for Escherichia coli O157:H7 in Canada does not consistently provide optimal discrimination, and therefore, differentiating temporally and/or geographically associated sporadic cases from potential outbreak cases can at times impede investigations. To address this limitation, DNA sequence-based methods such as multilocus variable-number tandem-repeat analysis (MLVA) have been explored. To assess the performance of MLVA as a supplemental method to PFGE from the Canadian perspective, a retrospective analysis of all E. coli O157:H7 isolated in Canada from January 2008 to December 2012 (inclusive) was conducted. A total of 2285 E. coli O157:H7 isolates and 63 clusters of cases (by PFGE) were selected for the study. Based on the qualitative analysis, the addition of MLVA improved the categorization of cases for 60% of clusters and no change was observed for ∼40% of clusters investigated. In such situations, MLVA serves to confirm PFGE results, but may not add further information per se. The findings of this study demonstrate that MLVA data, when used in combination with PFGE-based analyses, provide additional resolution to the detection of clusters lacking PFGE diversity as well as demonstrate good epidemiological concordance. In addition, MLVA is able to identify cluster-associated isolates with variant PFGE pattern combinations that may have been previously missed by PFGE alone. Optimal laboratory surveillance in Canada is achieved with the application of PFGE and MLVA in tandem for routine surveillance, cluster detection, and outbreak response.
Multilocus variable-number tandem repeat analysis (MLVA) is a rapid and reproducible typing method that is an important tool for investigation, as well as detection, of national and multinational outbreaks of a range of food-borne pathogens. Salmonella enterica serovar Enteritidis is the most common Salmonella serovar associated with human salmonellosis in the European Union/European Economic Area and North America. Fourteen laboratories from 13 countries in Europe and North America participated in a validation study for MLVA of S. Enteritidis targeting five loci. Following normalisation of fragment sizes using a set of reference strains, a blinded set of 24 strains with known allele sizes was analysed by each participant. The S. Enteritidis 5-loci MLVA protocol was shown to produce internationally comparable results as more than 90% of the participants reported less than 5% discrepant MLVA profiles. All 14 participating laboratories performed well, even those where experience with this typing method was limited. The raw fragment length data were consistent throughout, and the inter-laboratory validation helped to standardise the conversion of raw data to repeat numbers with at least two countries updating their internal procedures. However, differences in assigned MLVA profiles remain between well-established protocols and should be taken into account when exchanging data.
Background The reliability of culture-independent pathogen detection in foods using metagenomics is contingent on the quality and composition of the reference database. The inclusion of microbial sequences from a diverse representation of taxonomies in universal reference databases is recommended to maximize classification precision for pathogen detection. However, these sizable databases have high memory requirements that may be out of reach for some users. In this study, we aimed to assess the performance of a foodborne pathogen (FBP)-specific reference database (taxon-specific) relative to a universal reference database (taxon-agnostic). We tested our FBP-specific reference database's performance for detecting Listeria monocytogenes in two complex food matrices—ready-to-eat (RTE) turkey deli meat and prepackaged spinach—using three popular read-based DNA-to-DNA metagenomic classifiers: Centrifuge, Kraken 2 and KrakenUniq. Results In silico host sequence removal led to substantially fewer false positive (FP) classifications and higher classification precision in RTE turkey deli meat datasets using the FBP-specific reference database. No considerable improvement in classification precision was observed following host filtering for prepackaged spinach datasets and was likely a consequence of a higher microbe-to-host sequence ratio. All datasets classified with Centrifuge using the FBP-specific reference database had the lowest classification precision compared to Kraken 2 or KrakenUniq. When a confidence-scoring threshold was applied, a nearly equivalent precision to the universal reference database was achieved for Kraken 2 and KrakenUniq. Recall was high for both reference databases across all datasets and classifiers. Substantially fewer computational resources were required for metagenomics-based detection of L. monocytogenes using the FBP-specific reference database, especially when combined with Kraken 2. Conclusions A universal (taxon-agnostic) reference database is not essential for accurate and reliable metagenomics-based pathogen detection of L. monocytogenes in complex food matrices. Equivalent classification performance can be achieved using a taxon-specific reference database when the appropriate quality control measures, classification software, and analysis parameters are applied. This approach is less computationally demanding and more attainable for the broader scientific and food safety communities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.