2019
DOI: 10.1101/653774
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Genomic diversity affects the accuracy of bacterial SNP calling pipelines

Abstract: BackgroundAccurately identifying SNPs from bacterial sequencing data is an essential requirement for using genomics to track transmission and predict important phenotypes such as antimicrobial resistance. However, most previous performance evaluations of SNP calling have been restricted to eukaryotic (human) data. Additionally, bacterial SNP calling requires choosing an appropriate reference genome to align reads to, which, together with the bioinformatic pipeline, affects the accuracy and completeness of a se… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
11
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 90 publications
(121 reference statements)
0
11
0
Order By: Relevance
“…genus-level phylogeny) or genera within a single tree, nor have any been tested on microbial eukaryotic genomes. In addition, most of these tools require users to select a reference genome, which can have dramatic impact on the alignments and resulting SNP calls 26 , and are unable to distinguish or map SNPs to their functional annotation, and hence cannot perform molecular evolution analysis.…”
mentioning
confidence: 99%
“…genus-level phylogeny) or genera within a single tree, nor have any been tested on microbial eukaryotic genomes. In addition, most of these tools require users to select a reference genome, which can have dramatic impact on the alignments and resulting SNP calls 26 , and are unable to distinguish or map SNPs to their functional annotation, and hence cannot perform molecular evolution analysis.…”
mentioning
confidence: 99%
“…coli and Klebsiella spp. [37], together with NCBI canonical reference sequences for several pathogen species expected to be absent from sink drain microbiota (; left). This demonstrated similar sink, ward and temporal structure to that of culture, particularly underlying similarities in the microbiota of nearby sinks, as well as flux between sampling timepoints.…”
Section: Resultsmentioning
confidence: 99%
“…There is a need for these standards to be implemented to help harmonize the analysis and the consistency in the interpretation of WGS sequence data [29], [30]. It is also critical due to the various potential sources for error in sequencing and variant-calling processes [31], [32]. To optimise the quality of the data used to generate variant calls, many of the recommendations and best practises used are shared between human and microbial genomic NGS assays [33], [34].…”
Section: Comparison Of the Fastq Data Set Submitted By Laboratories mentioning
confidence: 99%