2022
DOI: 10.1002/cpz1.442
|View full text |Cite
|
Sign up to set email alerts
|

ntEdit+Sealer: Efficient Targeted Error Resolution and Automated Finishing of Long‐Read Genome Assemblies

Abstract: High-quality genome assemblies are crucial to many biological studies, and utilizing long sequencing reads can help achieve higher assembly contiguity. While long reads can resolve complex and repetitive regions of a genome, their relatively high associated error rates are still a major limitation. Long reads generally produce draft genome assemblies with lower base quality, which must be corrected with a genome polishing step. Hybrid genome polishing solutions can greatly improve the quality of long-read geno… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1

Relationship

4
1

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 24 publications
0
8
0
Order By: Relevance
“…After the golden path is built, the goldtigs are polished to correct mismatches and indels. The polishing protocol, GR-Edit, closely follows the ntEdit+Sealer protocol (J. X. Li et al, 2022), which has been shown to perform well using short sequencing reads.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…After the golden path is built, the goldtigs are polished to correct mismatches and indels. The polishing protocol, GR-Edit, closely follows the ntEdit+Sealer protocol (J. X. Li et al, 2022), which has been shown to perform well using short sequencing reads.…”
Section: Methodsmentioning
confidence: 99%
“…The polishing protocol, GR-Edit, closely follows the ntEdit+Sealer protocol (J. X. Li et al, 2022; Paulino et al, 2015; Warren et al, 2019), which has been shown to perform well using short sequencing reads. Unlike the currently published paradigm, which stores all the short read k-mers in a single Bloom filter used for correction, GR-Edit uses a targeted approach where each goldtig has a dedicated Bloom filter, each containing a hash representation of k-mers derived from long read subsets.…”
Section: Methodsmentioning
confidence: 99%
“…Following this gap-filling step, the scaffolds are output in FASTA format. Because the gaps are filled with raw long-read sequence, we recommend polishing the output assembly using long-read polishing tools such as ntEdit+Sealer (Li et al, 2022;Wong et al, 2022), Racon (Vaser et al, 2017) For more information about installing these dependencies, see Support Protocol and Basic Protocol 1, steps 1-2. Instructions for creating a conda environment that can be used for installing protocol-specific dependencies, as described below, are available in Support Protocol.…”
Section: Basic Protocol 2 Ntlink Scaffolding With Gap-fillingmentioning
confidence: 99%
“…If soft_mask=True is specified in the ntLink command when gap‐filling is enabled, the gaps will be filled with lowercase bases instead of uppercase bases. This soft masking could be useful for downstream analyses such as targeted polishing, for example (Li et al, 2022).…”
Section: Commentarymentioning
confidence: 99%
See 1 more Smart Citation