2022
DOI: 10.1109/access.2022.3182714
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating Edit-Distance Sequence Alignment on GPU Using the Wavefront Algorithm

Abstract: Sequence alignment remains a fundamental problem with practical applications ranging from pattern recognition to computational biology. Traditional algorithms based on dynamic programming are hard to parallelize, require significant amounts of memory, and fail to scale for large inputs. This work presents eWFA-GPU, a GPU (graphics processing unit)-accelerated tool to compute the exact editdistance sequence alignment based on the wavefront alignment algorithm (WFA). This approach exploits the similarities betwe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 70 publications
0
0
0
Order By: Relevance
“…The extend pyramid Ê is the pyramidal form of the extend table 𝐸 as defined in Eq. (8), which contains the length of the longest common prefixes of P and 𝑇 starting at positions 𝑖 and 𝑗 respectively.…”
Section: 𝐷 𝑖𝑗 = D𝑗−𝑖[𝑖>𝑗]𝑗+[𝑖≤𝑗]𝑖mentioning
confidence: 99%
See 2 more Smart Citations
“…The extend pyramid Ê is the pyramidal form of the extend table 𝐸 as defined in Eq. (8), which contains the length of the longest common prefixes of P and 𝑇 starting at positions 𝑖 and 𝑗 respectively.…”
Section: 𝐷 𝑖𝑗 = D𝑗−𝑖[𝑖>𝑗]𝑗+[𝑖≤𝑗]𝑖mentioning
confidence: 99%
“…When working with short strings, the workload is relatively small compared to the large number of processing elements (PEs) available on a GPU, resulting in low GPU occupancy. To address this, it is common practice to group a large number of short string pairs and process them in parallel, as described in [8]. However, when dealing with large strings, even with a pyramidal data structure, the amount of work D. Castells-Rufas Fig.…”
Section: 𝛴|mentioning
confidence: 99%
See 1 more Smart Citation
“…This particular property of DNA enables it to use in various fields [1]. DNA sequence matching is an important application used in forensics for identification purposes and in medicine and agriculture for exploring possible disease and abnormality diagnoses [2], [3]. A DNA sequence is composed of four letters: A, C, G, and T, as shown in Figure 1.…”
Section: Introductionmentioning
confidence: 99%
“…Some sequence alignment methods also use hardware-based or data-centric approaches that use special hardware or specific data structures such as FM-Index for faster processing [12], [13], [14]. However, most of the methods depend on algorithm-centric approaches such as edit distance, estimation algorithms, dynamic programming, etc., for fast processing [2], [15]. In the dynamic programming-based approach, a larger problem is divided into smaller independent sub-problems, and these sub-problems are solved individually [16].…”
Section: Introductionmentioning
confidence: 99%