2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2018
DOI: 10.1109/bibm.2018.8621546
|View full text |Cite
|
Sign up to set email alerts
|

SORA: Scalable Overlap-graph Reduction Algorithms for Genome Assembly using Apache Spark in the Cloud

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 19 publications
0
4
0
Order By: Relevance
“…The experiments are divided into two groups: (a) parallel performance and scalability of diBELLA 2D and (b) performance comparison with related work. In the latter case, we compare the overlap detection of diBELLA 2D with diBELLA 1D [9] and minimap2 [16] written in C for shared memory, while we compare our transitive reduction algorithm with SORA [23] written in Scala on top of Apache Spark [24].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The experiments are divided into two groups: (a) parallel performance and scalability of diBELLA 2D and (b) performance comparison with related work. In the latter case, we compare the overlap detection of diBELLA 2D with diBELLA 1D [9] and minimap2 [16] written in C for shared memory, while we compare our transitive reduction algorithm with SORA [23] written in Scala on top of Apache Spark [24].…”
Section: Methodsmentioning
confidence: 99%
“…SORA [23] computes transitive reduction of a string graph based on an overlap graph in distributed memory using Apache Spark [24] and the GraphX library [25], which allows parallel computation on distributed graphs in Spark. To the best of our knowledge, SORA is the only other distributed algorithm that computes transitive reduction on overlap graphs, although it was designed for cloud environments.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It runs on a spark platform achieving 13X, speeding up for de novo genome assembly faster than the traditional platform. A recent similar study was introduced by [28]. The IBD algorithm addressed pairwise comparison problems that can be used to overlap two reads or align reads against k-mer of the reference genome [29].…”
Section: Related Workmentioning
confidence: 99%