2022
DOI: 10.1101/2022.10.25.513734
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GoldRush: Ade novolong read genome assembler with linear time complexity

Abstract: Motivation: Current state-of-the-art long readde novogenome assemblers follow the Overlap Layout Consensus (OLC) paradigm, an O(n2) algorithm in its naïve implementation. While the most time- and memory-intensive step of OLC —the all-vs-all sequencing read alignment process— was improved and reimplemented in modern long read assemblers, these tools still often require excessive computational memory when assembling a typical 50X human genome dataset. Results: Here we present GoldRush, a de novo genome assembly … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…The GoldRush, Flye, Redbean, and Shasta genome assemblies generated in this study have been deposited in Zenodo at https://doi.org/10. 5281/zenodo.7884681 53 . The GoldRush genome assemblies generated for the parameter sweep experiments in Supplementary Figs.…”
Section: Reporting Summarymentioning
confidence: 99%
See 1 more Smart Citation
“…The GoldRush, Flye, Redbean, and Shasta genome assemblies generated in this study have been deposited in Zenodo at https://doi.org/10. 5281/zenodo.7884681 53 . The GoldRush genome assemblies generated for the parameter sweep experiments in Supplementary Figs.…”
Section: Reporting Summarymentioning
confidence: 99%
“…GoldRush (v1.0.0) has been deposited in Zenodo at https://doi.org/10. 5281/zenodo.7884291 54 . GoldRush is available at https://github.com/ bcgsc/goldrush and released under the GPL-3 license.…”
Section: Reporting Summarymentioning
confidence: 99%
“…Following this gap-filling step, the scaffolds are output in FASTA format. Because the gaps are filled with raw long-read sequence, we recommend polishing the output assembly using long-read polishing tools such as ntEdit+Sealer (Li et al, 2022;Wong et al, 2022), Racon (Vaser et al, 2017) For more information about installing these dependencies, see Support Protocol and Basic Protocol 1, steps 1-2. Instructions for creating a conda environment that can be used for installing protocol-specific dependencies, as described below, are available in Support Protocol.…”
Section: Basic Protocol 2 Ntlink Scaffolding With Gap-fillingmentioning
confidence: 99%
“…ntLink is a lightweight, minimizer‐based long‐read scaffolding tool that was previously published as a central step in the correction and scaffolding pipeline LongStitch (Coombe et al., 2021). More recently, ntLink was integrated as a key step in our de novo long‐read assembler GoldRush (Wong et al., 2022). ntLink uses long‐read evidence to further contiguate draft assemblies from any sequencing technology.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation