Efficient MPI‐AllReduce for large‐scale deep learning on GPU‐clusters

Nguyen, Truong Thao; Wahib, Mohamed; Takano, Ryousei

doi:10.1002/cpe.5574

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2022

2024

Publication Types

Select...

Article4

Other1

Relationship

Self Cite0

Independent5

Authors

Journals

Cited by 6 publications

(1 citation statement)

References 40 publications

(106 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…MPI provides a powerful set of communication and synchronization mechanisms, enabling efficient communication and collaboration in parallel programs. Among these mechanisms, the Allreduce operation in MPI holds significant importance [2] . It is used for data reduction among multiple processes and is commonly employed for calculations such as summation and finding the maximum value.…”

Section: Introductionmentioning

confidence: 99%