2009
DOI: 10.1186/1756-0500-2-73
|View full text |Cite
|
Sign up to set email alerts
|

CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units

Abstract: BackgroundThe Smith-Waterman algorithm is one of the most widely used tools for searching biological sequence databases due to its high sensitivity. Unfortunately, the Smith-Waterman algorithm is computationally demanding, which is further compounded by the exponential growth of sequence databases. The recent emergence of many-core architectures, and their associated programming interfaces, provides an opportunity to accelerate sequence database searches using commonly available and inexpensive hardware.Findin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
183
0
7

Year Published

2010
2010
2012
2012

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 224 publications
(191 citation statements)
references
References 21 publications
1
183
0
7
Order By: Relevance
“…Several alternative implementations for accelerating the Smith-Waterman algorithm using FPGAs ( [14], [15]), vec- tor operations on x86 CPUs [16], and GPUs (e.g., using CUDA [17]) exist. However, because the PaPaRa alignment kernel differs significantly from the standard SmithWaterman implementation, we omit a more detailed review at this point.…”
Section: Related Workmentioning
confidence: 99%
“…Several alternative implementations for accelerating the Smith-Waterman algorithm using FPGAs ( [14], [15]), vec- tor operations on x86 CPUs [16], and GPUs (e.g., using CUDA [17]) exist. However, because the PaPaRa alignment kernel differs significantly from the standard SmithWaterman implementation, we omit a more detailed review at this point.…”
Section: Related Workmentioning
confidence: 99%
“…Device Database Performance searched (Liu, Schmidt, Voss, Schroder & Muller-Wittig, 2006) GTX 7800 Swiss-Prot 650 MCUPS (Liu, Huang, Johnson & Vaidya, 2006) GTX 7800 983 protein 241 MCUPS sequences (Manavski & Valle, 2008) GTX 8800 Swiss-Prot 1.9 GCUPS (Liu et al, 2009) GTX 280 Swiss-Prot 9.5 GCUPS (Liu et al, 2010) GTX 280 Swiss-Prot 17 GCUPS (Kentie, 2010) GTX 275 Swiss-Prot 21.4 GCUPS …”
Section: Methodsmentioning
confidence: 99%
“…Furthermore, it is shown to scale almost linearly with the amount of GPUs used by simply splitting up the database. Various improvements have been suggested to the approach presented in (Manavski & Valle, 2008), as shown in (Akoglu & Striemer, 2009;Liu et al, 2009). In (Liu et al, 2009), for sequences of more than 3,072 amino acids an 'inter-task parallelization' method similar to the systolic array and OpenGL approaches is used as this, while slower, requires less memory.…”
Section: Current Implementationsmentioning
confidence: 99%
See 2 more Smart Citations