OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures

Mudalige, Gihan R.; Giles, Michael B.; Reguly, István Z.; Bertolli, Carlo; Kelly, Paul H. J.

doi:10.1109/inpar.2012.6339594

Cited by 78 publications

(72 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With such an explicit access-descriptor, OP2 allows for optimization and parallel programming experts to choose significantly more radical implementations for very specific hardware in order to gain near-optimal performance. This paper documents a number of significant developments in the design of OP2's heterogeneous back-ends and their performance extending our previous work in [36]: (1) A major contribution is the development of OP2's MPI+OpenMP back-end design and performance which augments the MPI only and MPI+CUDA implementations. This new back-end provides key insights into the performance limiting factors of modern multi-core clusters, particularly demonstrating the issues encountered on NUMA type architectures of multi-core nodes.…”

Section: Related Worksupporting

confidence: 59%

Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems

et al. 2013

Self Cite

View full text Add to dashboard Cite

Section: Related Worksupporting

confidence: 59%

Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems

et al. 2013

Self Cite

View full text Add to dashboard Cite

“…Later in [13], [14], [16], crucial efforts of evaluating the thread-level performance potentials of PETSc-FUN3D on wide spectrum of architectures are presented. On the other hand, SU2 code of Stanford [59] and OP2 code of Oxford [60] are considered to be the state-of-the-practice unstructured CFD research codes, which both have recently been ported into many emerging HPC architectures [61], [62].…”

Section: Unstructured Aerodynamics Computationsmentioning

confidence: 99%

Optimizations of Unstructured Aerodynamics Computations for Many-core Architectures

Farhan

Keyes

2018

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

“…The OP2 library (Mudalige et al (2012)) is a domain specific language embedded in C and Fortran that allows unstructured mesh algorithms to be expressed at a high level, and provides automatic parallelisation and a number of other features. It 20 provides an abstraction that lets the domain scientist describe a mesh using a number of sets (such as quadrilaterals or vertices), connections between these sets (such as edges-to-nodes), and data defined on sets (such as x, y coordinates on vertices).…”

Section: The Op2 Domain Specific Languagementioning

confidence: 99%

“…OP2, by Mudalige et al (2012), is such a DSL, embedded in C/C++ and Fortran; it has been in development since 2009: it provides an abstraction for expressing unstructured mesh computations at a high-level, and then provides automated tools to translate scientific code written once, into a range of high-performance implementations targeting multi-core CPUs, GPUs, and large heterogeneous supercomputers. The original VOLNA model (Dutykh et al (2011)) was already discussed and validated 15 in detail -was used in production for small-scale experiments and modelling, but was inadequate for targeting large-scale scenarios and statistical analysis, therefore it was re-implemented on top of OP2; this paper describes the process, challenges and results from that work.…”

mentioning

confidence: 99%

The VOLNA-OP2 Tsunami Code (Version 1.0)

Reguly¹,

Gopinathan²,

Beck³

et al. 2018

Preprint

View full text Add to dashboard Cite

Abstract. In this paper, we present the VOLNA-OP2 tsunami model and implementation; a finite volume non-linear shallow water equations (NSWE) solver built on the OP2 domain specific language for unstructured mesh computations. VOLNA-OP2 is unique among tsunami solvers in its support for several high performance computing platforms: CPUs, the Intel Xeon Phi, and GPUs. This is achieved in a way that the scientific code is kept separate from various parallel implementations, enabling easy maintainability. It has already been used in production for several years, here we discuss how it can be integrated into 5 various workflows, such as a statistical emulator. The scalability of the code is demonstrated on three supercomputers, built with classical Xeon CPUs, the Intel Xeon Phi, and NVIDIA P100 GPUs. VOLNA-OP2 shows an ability to deliver productivity to its users, as well as performance and portability on a number of platforms.

show abstract

OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures

Cited by 78 publications

References 16 publications

Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems

Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems

Optimizations of Unstructured Aerodynamics Computations for Many-core Architectures

The VOLNA-OP2 Tsunami Code (Version 1.0)

Contact Info

Product

Resources

About