2018 IEEE International Conference on Cluster Computing (CLUSTER) 2018
DOI: 10.1109/cluster.2018.00021
|View full text |Cite
|
Sign up to set email alerts
|

OpenACC vs the Native Programming on Sunway TaihuLight: A Case Study with GTC-P

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…But the problem of threads' write conflict on sunway CPEs must be solved before involving this trick. When we assign computation tasks of one MPI processor to 64 CPE threads (In sunway native programming, one CPE can launch a light-weight working thread using Athread library [33]. ), spatial decomposition of simulation domain is achieved.…”
Section: Tasks Assignment On Cpesmentioning
confidence: 99%
“…But the problem of threads' write conflict on sunway CPEs must be solved before involving this trick. When we assign computation tasks of one MPI processor to 64 CPE threads (In sunway native programming, one CPE can launch a light-weight working thread using Athread library [33]. ), spatial decomposition of simulation domain is achieved.…”
Section: Tasks Assignment On Cpesmentioning
confidence: 99%
“…Currently, to port applications to these systems and achieve good performance requires quite a lot of efforts. One reason is that "automatic" parallelization through, for example, OpenACC results in rather poor performances in the case of TaihuLight (Cai et al, 2018), and currently only a subset of OpenCL is available on PEZY-SC2. In the case of PEZY-SC2, right now the host CPU on chip is disabled and thus the current system is an accelerator system with a Xeon-D CPU.…”
Section: Introductionmentioning
confidence: 99%