2021
DOI: 10.1186/s12859-021-04174-9
|View full text |Cite
|
Sign up to set email alerts
|

SAUTE: sequence assembly using target enrichment

Abstract: Background Illumina is the dominant sequencing technology at this time. Short length, short insert size, some systematic biases, and low-level carryover contamination in Illumina reads continue to make assembly of repeated regions a challenging problem. Some applications also require finding multiple well supported variants for assembled regions. Results To facilitate assembly of repeat regions and to report multiple well supported variants when a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 48 publications
0
7
0
Order By: Relevance
“…All fluoroquinolones requiring SNP confirmation were verified by identifying the SNPs conferring resistance in Tablet [44] using the AMR++ output. For annotation by AMR FinderPlus, fastq files were aligned against the AMRFinder Plus database using SAUTE [45] on the CFSAN HPC. https://github.com/ncbi/amr/wiki/Methods.…”
Section: Bioinformatic Analysesmentioning
confidence: 99%
“…All fluoroquinolones requiring SNP confirmation were verified by identifying the SNPs conferring resistance in Tablet [44] using the AMR++ output. For annotation by AMR FinderPlus, fastq files were aligned against the AMRFinder Plus database using SAUTE [45] on the CFSAN HPC. https://github.com/ncbi/amr/wiki/Methods.…”
Section: Bioinformatic Analysesmentioning
confidence: 99%
“…Traditional phylogenetic marker loci were often selected because they are amenable to reliable Sanger sequencing due in part to their multi-copy nature (rDNA) or presence on the highly abundant mitochondrial genome (COI and cytB). In contrast, these attributes are detrimental to de novo assembly as within-genome copy variation often confounds assembly methods ( Souvorov and Agarwala, 2021 ) resulting in fragmented or missing loci. Despite this limitation we were able to construct an HSP90 phylogeny from our sequence data using targeted assembly of the single-copy nuclear gene thereby allowing comparison to previously published work.…”
Section: Discussionmentioning
confidence: 99%
“…A single-gene phylogeny was constructed for Heat Shock Protein 90 (HSP90) to facilitate comparison to other Globodera sequences available in Genbank. A single 1402 bp HSP90 sequence from G. ellingtonae (MK105568.1) was used as a reference to anchor the assembly of the locus from each sample using the software package SAUTE ( Souvorov and Agarwala, 2021 ). Sequences from identified Globodera and Cactodera (outgroup) from PopSet 1780190675 ( Skantar et al, 2021 ) were used for comparison, as were the partial HSP90 sequences obtained by Lax et al (2014) from a G. ellingtonae population collected in Argentina.…”
Section: Methodsmentioning
confidence: 99%
“…(v0.6.2; (accessed on 27 January 2022)) was used to filter the paired-end reads (Q > 30 and length > 50 bp) [ 23 ]. De novo assembly was performed using SKESA v2.3.0 ( (accessed on 27 January 2022)) and the assembled contigs were polished using Shovill (v1.1.0; (accessed on 27 January 2022)) [ 24 , 25 ]. Assembly metrics were calculated using QUAST (v5.0.2; (accessed on 27 January 2022)) [ 26 ].…”
Section: Methodsmentioning
confidence: 99%