Sustained Simulation Performance 2018 and 2019 2020
DOI: 10.1007/978-3-030-39181-2_9
|View full text |Cite
|
Sign up to set email alerts
|

VEO and PyVEO: Vector Engine Offloading for the NEC SX-Aurora Tsubasa

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 4 publications
0
3
0
Order By: Relevance
“…Vector Engine Offloading (VEO) [8] is a programming model which executes the main program on the VH and offloads kernels to the VEs. While the API somewhat resembles OpenCL, it differs from it due to the SIMD/vector nature of kernels, and due to their ability to execute almost any Linux system call.…”
Section: Background 1sx-aurora Vector Enginementioning
confidence: 99%
“…Vector Engine Offloading (VEO) [8] is a programming model which executes the main program on the VH and offloads kernels to the VEs. While the API somewhat resembles OpenCL, it differs from it due to the SIMD/vector nature of kernels, and due to their ability to execute almost any Linux system call.…”
Section: Background 1sx-aurora Vector Enginementioning
confidence: 99%
“…In LLVM this is handled by the libomptarget library, which selects at runtime a target device for the execution of the offloaded code. We developed a corresponding plugin based on the NEC VE Offloading (VEO [15]) interface. However, our current implementation benefits from the new AVEO [16] implementation, which shows a better performance compared to our original library.…”
Section: Openmp Target Device Offloading Designsmentioning
confidence: 99%
“…Besides OpenMP target device offloading, other approaches exist in order to execute computeintensive code parts on a SX-Aurora TSUBASA vector engine. The direct use of the low-level APIs VEO [15], AVEO [16] or VHCall [3] gives the programmer full control of the data transfers and the kernel execution. Noack et al [23] built on top of the portable Heterogeneous Active Messages (HAM) a high-level C++-only framework for SX-Aurora TSUBASA offloading.…”
Section: Related Workmentioning
confidence: 99%