2018
DOI: 10.1101/464420
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

BELLA: Berkeley Efficient Long-Read to Long-Read Aligner and Overlapper

Abstract: Recent advances in long-read sequencing enable the characterization of genome structure and its intra-and inter-species variation at a resolution that was previously impossible. Detecting overlaps between reads is integral to many long-read genomics pipelines, such as de novo genome assembly. While longer reads simplify genome assembly and improve the contiguity of the reconstruction, current long-read technologies come with high error rates. We present Berkeley Long-Read to Long-Read Aligner and Overlapper (B… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 12 publications
(35 citation statements)
references
References 35 publications
0
35
0
Order By: Relevance
“…For these reasons, we chose BELLA as the basis for our distributed memory algorithm. e quality produced by diBELLA is at least that of BELLA (see [13] for quality comparisons over data sets also used in this study), and higher when using less restricted sets of seeds than [13].…”
Section: Related Workmentioning
confidence: 89%
See 2 more Smart Citations
“…For these reasons, we chose BELLA as the basis for our distributed memory algorithm. e quality produced by diBELLA is at least that of BELLA (see [13] for quality comparisons over data sets also used in this study), and higher when using less restricted sets of seeds than [13].…”
Section: Related Workmentioning
confidence: 89%
“…ese include the minimum distance between seeds, and the maximum number of seeds to explore per overlap. A discussion of these se ings in relation to alignment accuracy versus computational cost is presented in the BELLA analysis [13]. In general, increasing the number of seeds to explore per overlap increases computational cost of the alignment stage (not necessarily linearly), depending on the pairwise alignment kernel employed.…”
Section: Overlapmentioning
confidence: 99%
See 1 more Smart Citation
“…SpGEMM is a relatively unknown primitive in genomics. Most notably, Besta et al [3] used SpGEMM to compute similarity between genomes in distributed memory, after the appearance of our preprint [15].…”
Section: Related Workmentioning
confidence: 99%
“…The threshold t = 2 kb is derived from the procedure proposed by Heng Li [19] and the ground truth is generated using Minimap2. A description of our evaluation procedure and ground truth generation can be found in the supplementary material of our preprint [15].…”
Section: Experimental Settingmentioning
confidence: 99%