2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) 2015
DOI: 10.1109/samos.2015.7363679
|View full text |Cite
|
Sign up to set email alerts
|

An FPGA-based systolic array to accelerate the BWA-MEM genomic mapping algorithm

Abstract: Abstract-We present the first accelerated implementation of BWA-MEM, a popular genome sequence alignment algorithm widely used in next generation sequencing genomics pipelines. The Smith-Waterman-like sequence alignment kernel requires a significant portion of overall execution time. We propose and evaluate a number of FPGA-based systolic array architectures, presenting optimizations generally applicable to variable length Smith-Waterman execution. Our kernel implementation is up to 3x faster, compared to soft… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
46
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 49 publications
(46 citation statements)
references
References 11 publications
0
46
0
Order By: Relevance
“…With a 16-PE accelerator engine, the seeds generation is accelerated by 4×, and the overall SMEM seeding stage by 26% when compared with 16-thread CPU execution † . Houtgast et al [78], [79] implement a hardware aligner based on BWA-MEM as well and the design is composed of a systolic array architecture to accelerate seed extension kernel with Smith-Waterman. By offloading the computational bottleneck onto Virtex-7 XC7VX690T-2 FPGA, the entire system can deliver a total acceleration of about 45%.…”
Section: Mappingmentioning
confidence: 99%
“…With a 16-PE accelerator engine, the seeds generation is accelerated by 4×, and the overall SMEM seeding stage by 26% when compared with 16-thread CPU execution † . Houtgast et al [78], [79] implement a hardware aligner based on BWA-MEM as well and the design is composed of a systolic array architecture to accelerate seed extension kernel with Smith-Waterman. By offloading the computational bottleneck onto Virtex-7 XC7VX690T-2 FPGA, the entire system can deliver a total acceleration of about 45%.…”
Section: Mappingmentioning
confidence: 99%
“…To our knowledge the only application-level accelerated integrated implementations of BWA-MEM that exist are: an FPGA-accelerated implementation of the Seed Extension phase [15] achieving a 1.5x speedup, further improved in [16] for an overall 2.6x speedup; and a GPU implementation [9], further improved to achieve an up to 2x speedup [17]. The FPGA implementation used here builds on [15], and a comparison of the implementation here is made to the improved GPU implementation.…”
Section: Related Workmentioning
confidence: 99%
“…The FPGA implementation used here builds on [15], and a comparison of the implementation here is made to the improved GPU implementation. This paper focuses on powerefficiency, besides overall application performance, as for many scenarios, such as processing in a large scale data center, this is at least as important as absolute performance.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations