2020
DOI: 10.1093/nar/gkaa566
|View full text |Cite
|
Sign up to set email alerts
|

STing: accurate and ultrafast genomic profiling with exact sequence matches

Abstract: Abstract Genome-enabled approaches to molecular epidemiology have become essential to public health agencies and the microbial research community. We developed the algorithm STing to provide turn-key solutions for molecular typing and gene detection directly from next generation sequence data of microbial pathogens. Our implementation of STing uses an innovative k-mer search strategy that eliminates the computational overhead associated with the time-consuming st… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 20 publications
0
11
0
Order By: Relevance
“…The comparison between different cgMLST workflows based on accuracy is hampered by the absence of a common strategy for definition of alleles, due to cgMLST approaches (i.e. assembly-based [ 12 , 14 – 17 ] or -free [ 13 , 18 , 19 ], or combination of both [ 20 ]), as well as implemented algorithmic steps and related parameters (e.g. BLAST-based or -free algorithms, BLASTN or BLASTP, detection of open reading frames (ORFs) before BLAST step, coverage and identity of aligned sequences [ 12 20 ]).…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…The comparison between different cgMLST workflows based on accuracy is hampered by the absence of a common strategy for definition of alleles, due to cgMLST approaches (i.e. assembly-based [ 12 , 14 – 17 ] or -free [ 13 , 18 , 19 ], or combination of both [ 20 ]), as well as implemented algorithmic steps and related parameters (e.g. BLAST-based or -free algorithms, BLASTN or BLASTP, detection of open reading frames (ORFs) before BLAST step, coverage and identity of aligned sequences [ 12 20 ]).…”
Section: Discussionmentioning
confidence: 99%
“…These findings are consistent with previous studies on viruses [ 49 ] and bacteria [ 50 ], that did not observe improvement of the breadth of coverage above specific values of depth of coverage. Few studies recommended minimal depth of coverage for precise cgMLST typing of L. monocytogenes (40X with BIGSdb) [ 24 , 28 , 36 ], Yersinia (50X with BIGSdb) [ 42 ], Mycoplasma (47X with SeqSphere) [ 38 ], Campylobacter , Chlamydia , Neisseria and Streptococcus (20X with STing) [ 13 ]. For the first time in the present study, we recommend 40X as a suitable read depth of coverage for the highest cgMLST precision across 6 different assembly-based and -free workflows.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Hence, we proceeded to investigate the association of shorter sequences (k-mers) with PFGE banding patterns. k-mers have been shown to be sufficiently informative short sequences used across a number of different applications including genome assembly (Compeau et al, 2011), read-togenome mapping (Li and Durbin, 2009), taxonomic classification (Nayfach et al, 2016;Wood et al, 2019), and multi-locus sequence typing (Gupta et al, 2017;Espitia-Navarro et al, 2020). Consequently, we represent our genomes as high-dimensional k-mer presence/absence vectors and utilize these vectors within a machine learning framework.…”
Section: Leveraging Wgs For Creating High-dimensional Modelsmentioning
confidence: 99%