Leveraging Multicore Cluster Nodes by Adding OpenMP to Flow Solvers Parallelized with MPI

Iwainsky, Christian; Sarholz, Samuel; Mey, Dieter an; Altenfeld, Ralph

doi:10.1007/978-3-642-12659-8_5

Cited by 1 publication

(2 citation statements)

References 2 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As a pure (dynamic) MPI parallelisation can suffer from communication overhead and high memory consumption (Rabenseifner and Wellein, 2003; Rabenseifner et al, 2009), our MPI-only version is enhanced with Pthreads to exploit intra-node shared-memory parallelism for a hybrid parallel program. Hybrid parallelisation has gained benefit in several applications, for example Rabenseifner et al (2009), Howison et al (2010) and Iwainsky et al (2010). As a first approach, we added the multi-threading layer on the same parallelisation level as for MPI and let each worker process distribute its work across corresponding threads with a static schedule.…”

Section: Functionality Of the Softwarementioning

confidence: 99%

“…To utilize these architectures, message passing is employed for inter- and intra-node communication in most scientific applications. Aiming at further scalability, the exploration of hybrid parallelism by adding a multi-threading layer can be advantageous (Balaji et al, 2009; Iwainsky et al, 2010). However, the required high scalability is only achievable with a sufficient load balancing that necessitates a careful analysis of the algorithm and the different workloads.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Towards an accurate simulation of the crystallisation process in injection moulded plastic components by hybrid parallelisation

Wienke

Spekowius

Dammer

et al. 2013

The International Journal of High Performance Computing Applica

View full text Add to dashboard Cite

The simulation of the crystallisation process during the injection moulding process of plastic components is time consuming, resulting in the ability to simulate only small parts of a component. To remove this constraint and enable the simulation of complex parts, the computing power of high-performance computers is demanded. A further design objective is high scalability in performance and memory consumption on today’s and future high-performance computing architectures to allow precise predictions of global part properties. In this work, we present a simulation tool for the crystallisation process and the parallelisation of the tool by a hybrid MPI-Pthreads approach that meets this design objective. We verify the performance and memory consumption of our parallelisation using a large simulation area of a realistic plastic component as a case study and can further predict that entire parts will also be calculable.

show abstract