Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
231
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 349 publications
(231 citation statements)
references
References 8 publications
0
231
0
Order By: Relevance
“…These delayed processes were observed on three different HPC systems and on different days, so they are unlikely to be infrastructure specific. In order to rule out the hypothesis that Dask is inherently limited in its applicability to our problem we re-implemented our mapreduce problem with MPI based on the Python mpi4py [DPS05], [DPKC11] module. The comparison was performed with the XTC600x trajectory on SDSC Comet.…”
Section: Performance Optimizationmentioning
confidence: 99%
See 1 more Smart Citation
“…These delayed processes were observed on three different HPC systems and on different days, so they are unlikely to be infrastructure specific. In order to rule out the hypothesis that Dask is inherently limited in its applicability to our problem we re-implemented our mapreduce problem with MPI based on the Python mpi4py [DPS05], [DPKC11] module. The comparison was performed with the XTC600x trajectory on SDSC Comet.…”
Section: Performance Optimizationmentioning
confidence: 99%
“…MDAnalysis does not yet provide a standard interface for parallel analysis; instead, various existing parallel libraries such as Python multiprocessing, joblib, and mpi4py [DPS05], [DPKC11] are currently used to parallelize MDAnalysis-based code on a case-by-case basis. Here we evaluated performance for parallel map-reduce [DG08] type analysis with the Dask parallel computing library [Roc15] for task-graph based distributed computing on HPC and local computing resources.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, the Parallel Python (PP) module [3] provides mechanisms for parallel execution of Python codes, with an API with specific functions to specify the number of workers to be used, submit the jobs for execution, get the results from the workers, etc. Finally, the mpi4py [20] library allows the user to open parallelism both inter-node and intra-node. In all cases, the management of the parallelism is the programmer's responsibility.…”
Section: Motivationmentioning
confidence: 99%
“…• Serial processing: SerialPool • Python standard-library multiprocessing: MultiPool • OpenMPI (Gabriel et al 2004) and mpich2 (Lusk, Doss, and Skjellum 1996) via the mpi4py package (Dalcín, Paz, and Storti 2005;Dalcín et al 2008): MPIPool • joblib: JoblibPool All pool classes provide a .map() method to distribute tasks to a specified worker function (or callable), and support specifying a callback function that is executed on the master process to enable post-processing or caching the results as they are delivered.…”
mentioning
confidence: 99%