2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) 2013
DOI: 10.1109/cibcb.2013.6595386
|View full text |Cite
|
Sign up to set email alerts
|

Managing memory and reducing I/O cost for correlation matrix calculation in bioinformatics

Abstract: Abstract-The generation of a correlation matrix from a large set of long gene sequences is a common requirement in many bioinformatics problems such as phylogenetic analysis. The generation is not only computationally intensive but also requires significant memory resources as, typically, few gene sequences can be simultaneously stored in primary memory. The standard practice in such computation is to use frequent input/output (I/O) operations. Therefore, minimizing the number of these operations will yield mu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…Its computation has been recently investigated for single computer platforms [17,18]. It is re-programmed for our experiments in this paper by using the Application Programming Interfaces (APIs) provided in our distributed computing framework.…”
Section: Performance Of the Distributed Computing Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…Its computation has been recently investigated for single computer platforms [17,18]. It is re-programmed for our experiments in this paper by using the Application Programming Interfaces (APIs) provided in our distributed computing framework.…”
Section: Performance Of the Distributed Computing Frameworkmentioning
confidence: 99%
“…All-to-All Comparisons are a key calculation stage in Multiple Sequence Alignment (MSA) and also in studying phylogenetic diversity in protein families [16]. In general, big data processing in these problems include calculation of a cross-similarity matrix between each pair of the data sequences [17,18]. This calculation is followed by several data grouping stages.…”
Section: Related Work and Motivationsmentioning
confidence: 99%