2016
DOI: 10.1186/s12859-016-1302-4
|View full text |Cite
|
Sign up to set email alerts
|

muBLASTP: database-indexed protein sequence search on multicore CPUs

Abstract: BackgroundThe Basic Local Alignment Search Tool (BLAST) is a fundamental program in the life sciences that searches databases for sequences that are most similar to a query sequence. Currently, the BLAST algorithm utilizes a query-indexed approach. Although many approaches suggest that sequence search with a database index can achieve much higher throughput (e.g., BLAT, SSAHA, and CAFE), they cannot deliver the same level of sensitivity as the query-indexed BLAST, i.e., NCBI BLAST, or they can only support nuc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 25 publications
0
6
0
Order By: Relevance
“…This process is used because alignments of interest are those whose score is greater than or equal to a minimum, and thus looking for such matches is similar to looking for them in an index. The techniques used to represent sequences and to create indexes depend on each tool used, but the most precise ones (like MegaBLAST) divide every sequence into k-mers, and store them in some type of index [6,38].…”
Section: General Remarksmentioning
confidence: 99%
“…This process is used because alignments of interest are those whose score is greater than or equal to a minimum, and thus looking for such matches is similar to looking for them in an index. The techniques used to represent sequences and to create indexes depend on each tool used, but the most precise ones (like MegaBLAST) divide every sequence into k-mers, and store them in some type of index [6,38].…”
Section: General Remarksmentioning
confidence: 99%
“…Given expanding amount of data, providing fast and biologically valuable sequence alignment tools via high-performance computing (HPC) and algorithmic innovations has been a highly active area of bioinformatics research, particularly in the context of rapidly expanding databases. For example, several sequence alignment programs have relied on contributing algorithmic improvements (e.g., HMMER [4], DIAMOND [5], CaBLAST [6]) while others have focused on improving parallelization to take advantage of emerging high-performance computing (HPC) platforms and programming paradigms (e.g., cuBLASTP [7], muBLASTP [8], mpiBLAST [9], SparkBLAST [10], and SparkLeBLAST [11]). Both DIAMOND [5] and CaBLAST [6] improve the execution time of sequence alignment by compressing the sequence database.…”
Section: Introductionmentioning
confidence: 99%
“…If single-threaded algorithms can be changed to multi-threaded algorithms, it will be extremely improve the calculation speed. For example, the single-threaded algorithm of sequence comparison has been changed to the multi-threaded algorithm (M et al, 2017), and the algorithm of protein sequence search has been changed to the multi-threaded algorithm (Zhang et al, 2016).…”
Section: Introductionmentioning
confidence: 99%