2008
DOI: 10.1186/1471-2105-9-558
|View full text |Cite
|
Sign up to set email alerts
|

SPRINT: A new parallel framework for R

Abstract: BackgroundMicroarray analysis allows the simultaneous measurement of thousands to millions of genes or sequences across tens to thousands of different samples. The analysis of the resulting data tests the limits of existing bioinformatics computing infrastructure. A solution to this issue is to use High Performance Computing (HPC) systems, which contain many processors and more memory than desktop computer systems. Many biostatisticians use R to process the data gleaned from microarray analysis and there is ev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
27
0

Year Published

2011
2011
2014
2014

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 25 publications
(27 citation statements)
references
References 12 publications
0
27
0
Order By: Relevance
“…Parallelisation libraries for R language, besides Rmpi, are SPRINT [24] and pR [25] packages, whose their main advantage is that they require very little modification to the existing sequential R scripts and no expertise in parallel computing; however, the master worker suffers from communication overhead, and the authors recognise that their approach may not yield the optimal schedule [25]. Other parallelisation libraries are snow and nws that provide coordination and parallel execution facilities.…”
Section: Related Workmentioning
confidence: 99%
“…Parallelisation libraries for R language, besides Rmpi, are SPRINT [24] and pR [25] packages, whose their main advantage is that they require very little modification to the existing sequential R scripts and no expertise in parallel computing; however, the master worker suffers from communication overhead, and the authors recognise that their approach may not yield the optimal schedule [25]. Other parallelisation libraries are snow and nws that provide coordination and parallel execution facilities.…”
Section: Related Workmentioning
confidence: 99%
“…Adding a new function involves writing the parallel algorithm (typically in C or Fortran) and an interface function in R [7]. This approach makes for much better ease of use by bioinformaticians: in most cases, their existing analysis scripts will need only minor changes.…”
Section: Exploiting Parallelism In Rmentioning
confidence: 99%
“…The SPRINT [7] project aims to implement a library that biostatisticians can use to exploit HPC systems while requiring only minimal changes to their existing analysis workflows. SPRINT provides drop-in replacements for a number of computationally expensive R functions that were identified as important in a user requirements survey of the bioinformatics community [8].…”
Section: Introductionmentioning
confidence: 99%
“…Parallel All-to-All Comparisons of genome sequences are considered by Hill et al [8]. The main focus of this work is on load balancing among the clusters by dividing the comparison matrix into rows and assigning these rows to different nodes dynamically.…”
Section: Related Work and Motivationsmentioning
confidence: 99%
“…There have been several efforts to address All-to-All Comparison Problems [4,5,6,7,8,9]. A main approach is to provide solutions for special-purpose All-to-All Comparison Problems only.…”
Section: Introductionmentioning
confidence: 99%