2024
DOI: 10.1007/s42514-023-00159-7
|View full text |Cite
|
Sign up to set email alerts
|

swCUDA: Auto parallel code translation framework from CUDA to ATHREAD for new generation sunway supercomputer

Maoxue Yu,
Guanghao Ma,
Zhuoya Wang
et al.

Abstract: Since specific hardware characteristics and low-level programming model are adapted to both NVIDIA GPU and new generation Sunway architecture, automatically translating mature CUDA kernels to Sunway ATHREAD kernels are realistic but challenging work. To address this issue, swCUDA, an auto parallel code translation framework is proposed. To that end, we create scale affine translation to transform CUDA thread hierarchy to Sunway index, directive based memory hierarchy and data redirection optimization to assign… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 27 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?