2010
DOI: 10.1007/978-3-642-12659-8_5
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging Multicore Cluster Nodes by Adding OpenMP to Flow Solvers Parallelized with MPI

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2013
2013

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 2 publications
0
2
0
Order By: Relevance
“…As a pure (dynamic) MPI parallelisation can suffer from communication overhead and high memory consumption (Rabenseifner and Wellein, 2003; Rabenseifner et al, 2009), our MPI-only version is enhanced with Pthreads to exploit intra-node shared-memory parallelism for a hybrid parallel program. Hybrid parallelisation has gained benefit in several applications, for example Rabenseifner et al (2009), Howison et al (2010) and Iwainsky et al (2010). As a first approach, we added the multi-threading layer on the same parallelisation level as for MPI and let each worker process distribute its work across corresponding threads with a static schedule.…”
Section: Functionality Of the Softwarementioning
confidence: 99%
See 1 more Smart Citation
“…As a pure (dynamic) MPI parallelisation can suffer from communication overhead and high memory consumption (Rabenseifner and Wellein, 2003; Rabenseifner et al, 2009), our MPI-only version is enhanced with Pthreads to exploit intra-node shared-memory parallelism for a hybrid parallel program. Hybrid parallelisation has gained benefit in several applications, for example Rabenseifner et al (2009), Howison et al (2010) and Iwainsky et al (2010). As a first approach, we added the multi-threading layer on the same parallelisation level as for MPI and let each worker process distribute its work across corresponding threads with a static schedule.…”
Section: Functionality Of the Softwarementioning
confidence: 99%
“…To utilize these architectures, message passing is employed for inter- and intra-node communication in most scientific applications. Aiming at further scalability, the exploration of hybrid parallelism by adding a multi-threading layer can be advantageous (Balaji et al, 2009; Iwainsky et al, 2010). However, the required high scalability is only achievable with a sufficient load balancing that necessitates a careful analysis of the algorithm and the different workloads.…”
Section: Introductionmentioning
confidence: 99%