2009
DOI: 10.18637/jss.v031.i01
|View full text |Cite
|
Sign up to set email alerts
|

State of the Art in Parallel Computing withR

Abstract: R is a mature open-source programming language for statistical computing and graphics. Many areas of statistical research are experiencing rapid growth in the size of data sets. Methodological advances drive increased use of simulations. A common approach is to use parallel computing. This paper presents an overview of techniques for parallel computing with R on computer clusters, on multi-core systems, and in grid computing. It reviews sixteen different packages, comparing them on their state of development, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
80
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 106 publications
(82 citation statements)
references
References 26 publications
2
80
0
Order By: Relevance
“…Two main architectural approaches are present for the processing of large amounts of statistical data: scaling the statistical tool or scaling the data management system [39]. Systems in the first category tend to stick to a specific tool and language and provide a scalable infrastructure for it.…”
Section: Scaling In Statistical Data Processingmentioning
confidence: 99%
“…Two main architectural approaches are present for the processing of large amounts of statistical data: scaling the statistical tool or scaling the data management system [39]. Systems in the first category tend to stick to a specific tool and language and provide a scalable infrastructure for it.…”
Section: Scaling In Statistical Data Processingmentioning
confidence: 99%
“…This approach is way faster than a for loop, where each single iteration only starts when the previous one has been completed. The key point to parallel computing is that the pieces of computations are unrelated (which is the case of some, but not all, loops); examples are evaluating the same R function or fitting the same model on different datasets, which can be particularly useful in simulation studies [34]. The basic approach sets up M different so-called 'worker' processes and splits the main task in M subtasks, which are sent to the workers by a 'master' process.…”
Section: Computationsmentioning
confidence: 99%
“…For parallel execution, the foreach package has been adapting the R parallel packages doMC 18 (based on the package multicore on single workstations), doSNOW 19 (based on the package snow with SOCKETS), and doMPI 20 (based on the package Rmpi).…”
Section: Foreach: Foreach Looping Construct For Rmentioning
confidence: 99%
“…18 http://cran.r-project.org/web/packages/doMC/index.html 19 http://cran.r-project.org/web/packages/doSNOW/index.html 20 http://cran.r-project.org/web/packages/doMPI/index.html 21 Luke Tierney is the maintainer of the R package snow.…”
Section: Parallel: Parallel R Packagementioning
confidence: 99%