2018
DOI: 10.1099/mgen.0.000151
|View full text |Cite
|
Sign up to set email alerts
|

Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico Typing Resource (SISTR)

Abstract: Public health and food safety institutions around the world are adopting whole genome sequencing (WGS) to replace conventional methods for characterizing Salmonella for use in surveillance and outbreak response. Falling costs and increased throughput of WGS have resulted in an explosion of data, but questions remain as to the reliability and robustness of the data. Due to the critical importance of serovar information to public health, it is essential to have reliable serovar assignments available for all of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
70
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 49 publications
(72 citation statements)
references
References 27 publications
1
70
0
1
Order By: Relevance
“…Notably, intraspecies contamination was more prevalent than cross-species contamination. A recent assessment of 67758 publically-available Salmonella sequences determined that 1.87% of samples had cross-species contamination based on a read classification approach (Robertson et al, 2018). Prevalence of cross-species sequence contamination in public repositories is a known issue that has been described in a number of studies (Merchant et al, 2014;Mukherjee et al, 2015;Lee et al, 2017;Cornet et al, 2018).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Notably, intraspecies contamination was more prevalent than cross-species contamination. A recent assessment of 67758 publically-available Salmonella sequences determined that 1.87% of samples had cross-species contamination based on a read classification approach (Robertson et al, 2018). Prevalence of cross-species sequence contamination in public repositories is a known issue that has been described in a number of studies (Merchant et al, 2014;Mukherjee et al, 2015;Lee et al, 2017;Cornet et al, 2018).…”
Section: Discussionmentioning
confidence: 99%
“…The presence of contamination in WGS data is recognized as an important sequence quality issue (Merchant et al, 2014;Ballenghien et al, 2017;Robertson et al, 2018;Cornet et al, 2018). Introduction of contaminants can occur at many stages in the generation of bacterial sequence data.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…В тех случаях, когда установить антигенную формулу невозможно из-за неполноты данных WGS, платформа SISTR выбирает наиболее вероятный серотип внутри кластера cgMLST. В настоящее время SISTR внедрена в систему здравоохранения Канады как замена классическому серотипированию [40,52].…”
Section: международные базы данных Wgs используемые для субтипированunclassified
“…IRIDA currently performs Salmonella serotype prediction with the Salmonella In Silico Typing Resource (SISTR), a validated bioinformatics platform for rapid in silico inference from draft Salmonella genome assemblies (34). SISTR performs highly-accurate serovar prediction based on genoserotyping through sequence analysis of the Salmonella O and H antigens, with additional refinement of predictions based on population structure context via cgMLST analysis and genomic distance calculation using MASH (9,34,35). Results generated by SISTR are then incorporated into the Sample metadata and can be conveniently viewed in a single table.…”
Section: Serotype Predictionmentioning
confidence: 99%