2020
DOI: 10.1101/2020.01.27.921817
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

HASLR: Fast Hybrid Assembly of Long Reads

Abstract: Third generation sequencing technologies from platforms such as Oxford Nanopore Technologies and Pacific Biosciences have paved the way for building more contiguous assemblies and complete reconstruction of genomes. The larger effective length of the reads generated with these technologies has provided a mean to overcome the challenges of short to mid-range repeats. Currently, accurate long read assemblers are computationally expensive while faster methods are not as accurate. Therefore, there is still an unme… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 40 publications
(54 reference statements)
0
12
0
Order By: Relevance
“…First, the long‐read assembly pipeline wtdbg2 (v2.5; Ruan & Li, 2020) was used to generate an initial set of contigs using the parameter –X 25, ‐x preset1, ‐g 1.3G, ‐L 8k, ‐p 21, ‐‐edge‐min 2, and ‐‐rescue‐low‐cov‐edges set. In a second assembly, the short reads were combined with a 25× subset of the longest nanopore reads using the hybrid assembler HASLR (v.0.8a1; Haghshenas et al, 2020) under default settings. Both assemblies were then merged into a single draft using Quickmerge (v.0.3; Chakraborty et al, 2016).…”
Section: Methodsmentioning
confidence: 99%
“…First, the long‐read assembly pipeline wtdbg2 (v2.5; Ruan & Li, 2020) was used to generate an initial set of contigs using the parameter –X 25, ‐x preset1, ‐g 1.3G, ‐L 8k, ‐p 21, ‐‐edge‐min 2, and ‐‐rescue‐low‐cov‐edges set. In a second assembly, the short reads were combined with a 25× subset of the longest nanopore reads using the hybrid assembler HASLR (v.0.8a1; Haghshenas et al, 2020) under default settings. Both assemblies were then merged into a single draft using Quickmerge (v.0.3; Chakraborty et al, 2016).…”
Section: Methodsmentioning
confidence: 99%
“…New software and technology versions are frequently released, thus making comparisons obsolete within months. There are efforts to optimise assemblers towards speed and reduced memory usage (Gatter et al, 2021;Haghshenas et al, 2020;Shafin et al, 2020). While this is important to complete extremely large plant genome assemblies and to reduce the environmental impact of bioinformatics, quality improvements are still of interest and would be beneficial for smaller genomes.…”
Section: Computational Future Of Plant Genomicsmentioning
confidence: 99%
“…Canu, MECAT, MECAT2, NECAT and NextDenovo correct low-accuracy long reads prior to assembly; Flye, Raven and NextDenovo have a polishing step; and many assemblers include a scaffolding step to yield both contigs and scaffolds. Some assemblers propose a hybrid assembly strategy, using both short and long reads, such as HALSR [246], MaSuRCA [247] and WENGAN [248].…”
Section: Long Readsmentioning
confidence: 99%