30th IEEE International Performance Computing and Communications Conference 2011
DOI: 10.1109/pccc.2011.6108104
|View full text |Cite
|
Sign up to set email alerts
|

CUDA acceleration of P7Viterbi algorithm in HMMER 3.0

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 2 publications
0
2
0
Order By: Relevance
“…The main loop of this function is amenable to vectorization as it exhibits a high degree of data parallelism [42]. Modern ARM processors support the NEON instruction set extension which operate on four scalar values at a time.…”
Section: A Auto-vectorizationmentioning
confidence: 99%
“…The main loop of this function is amenable to vectorization as it exhibits a high degree of data parallelism [42]. Modern ARM processors support the NEON instruction set extension which operate on four scalar values at a time.…”
Section: A Auto-vectorizationmentioning
confidence: 99%
“…Different sequences were assigned to individual threads in both methods. Partial optimization was proposed in [ 19 ], which parallelizes the P7Viterbi part without considering the D - D path dependency. Although this approach claims a 14x speedup than original functions, it sacrifices the sensitivity of probabilistic inference.…”
Section: Introductionmentioning
confidence: 99%