2014 IEEE 4th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) 2014
DOI: 10.1109/iccabs.2014.6863917
|View full text |Cite
|
Sign up to set email alerts
|

SAIS-OPT: On the characterization and optimization of the SA-IS algorithm for suffix array construction

Abstract: Abstract-The suffix array and Burrows-Wheeler Transform are critical index structures in next generation sequence analysis. The construction of such index structures for mammalian-sized genomes can take thousands of seconds (i.e. tens of minutes). Its construction is complicated by computational overheads that coming from irregular or complex memory-access patterns. This paper rigorously characterizes the execution profile of the SA-IS algorithm in order to guide its optimization. The resulting optimized SA-IS… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 18 publications
0
7
0
Order By: Relevance
“…As mentioned above, Com-Zip also keeps up with the optimal algorithms of the 3 classes, but each time we cannot focus on too many points. In [6], we have compared the time complexities of traditional BWT, BWT with SA-IS [11], BWT with GSACA [12], and CZ-BWT, which are current BWT studies.…”
Section: Journal Of Sensorsmentioning
confidence: 99%
“…As mentioned above, Com-Zip also keeps up with the optimal algorithms of the 3 classes, but each time we cannot focus on too many points. In [6], we have compared the time complexities of traditional BWT, BWT with SA-IS [11], BWT with GSACA [12], and CZ-BWT, which are current BWT studies.…”
Section: Journal Of Sensorsmentioning
confidence: 99%
“…SA-IS requires special ending symbol for the block and recursive reduction to a shorter string [2,13], which are more complex than CZ-BWT. SA-IS needs to scan the BWT block for more than 3 times, while according to Algorithm 1, CZ-BWT encoding just scan the block twice in phases 1 and 2.…”
Section: Time Complexitiesmentioning
confidence: 99%
“…Quite a few recent advancements of BWT algorithms are driven by the rapid development of genome information technologies [2,3,13,18], and there are many DNA softwares using BWT, including DNA compression, alignment, sequencing, and indexing. Due to the difference between the DNA and common data charsets, we cannot proceed direct experiments to compare a DNA software such as BWA (Burrows-Wheeler aligner) with a universal compression software such as ComZip or bzip2, but we can analyze their BWT algorithms to investigate their advantages and shortcomings.…”
Section: Analyses Of the Cz-bwt Algorithmmentioning
confidence: 99%
See 2 more Smart Citations