2021
DOI: 10.1007/978-3-030-78713-4_7
|View full text |Cite
|
Sign up to set email alerts
|

Designing a ROCm-Aware MPI Library for AMD GPUs: Early Experiences

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 14 publications
0
1
0
Order By: Relevance
“…In other words, we reduce the number of inter-node communications and offload it to the intranode interconnect. Since the inter-node interconnects have significantly lower bandwidth than intra-node interconnects [5], this can improve communication performance.…”
Section: Algorithm 1: Intra-node Communication Load Matrixmentioning
confidence: 99%
See 2 more Smart Citations
“…In other words, we reduce the number of inter-node communications and offload it to the intranode interconnect. Since the inter-node interconnects have significantly lower bandwidth than intra-node interconnects [5], this can improve communication performance.…”
Section: Algorithm 1: Intra-node Communication Load Matrixmentioning
confidence: 99%
“…The broad deployment of AMD GPUs in top supercomputers emphasizes the importance of optimizing AMD GPU communications. Improvements to Inter-GPU communication bandwidth have been found to correspond to improvements in application performance [5]. The multi-GPU computing nodes in AMD platforms are equipped with the Infinity Fabric TM link.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…While NVIDIA's CUDA is a more established GPU programming framework, AMD's ROCm 3 represents a universal platform for GPU-accelerated computing. ROCm introduced new numerical formats to support common open-source machine learning libraries such as TensorFlow and PyTorch; it also provides the means for porting NVIDIA CUDA code into AMD hardware 4 . It is important to note that AMD not only is catching up to the ROCm platform in the GPU computing race, but also recently introduced the new flagship GPU architecture AMD Instinct MI200 Series 5 to compete with the latest NVIDIA Ampere A100 GPU architecture 6 .…”
mentioning
confidence: 99%