2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935)
DOI: 10.1109/clustr.2004.1392611
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Barrier and Allreduce on Infiniband clusters using multicast and adaptive algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
18
0

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(18 citation statements)
references
References 3 publications
0
18
0
Order By: Relevance
“…The butterfly-like algorithm has been developed some times ago [22,27] and has been extended to handle non-power-of-two numbers of processes [23]. Various architecture specific all-reduce schemes have also been developed [1,4,12,17,26]. An all-reduce algorithm was designed for BlueGene/L systems in [1].…”
Section: Ethernet Switched Cluster Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The butterfly-like algorithm has been developed some times ago [22,27] and has been extended to handle non-power-of-two numbers of processes [23]. Various architecture specific all-reduce schemes have also been developed [1,4,12,17,26]. An all-reduce algorithm was designed for BlueGene/L systems in [1].…”
Section: Ethernet Switched Cluster Resultsmentioning
confidence: 99%
“…In [12], an all-reduce scheme that takes advantage of remote DMA (RDMA) capability was developed for VIA-based clusters. The work in [17] investigated an adaptive all-reduce algorithm in an InfiniBand cluster that deals with the situation when not all nodes arrive at the call site at the same time. A study on the all-reduce operation over WAN can be found in [4].…”
Section: Ethernet Switched Cluster Resultsmentioning
confidence: 99%
“…Example algorithms include the all-reduce and barrier algorithms in [17] and the broadcast and reduce algorithms in [28]. Our work advocates further development of such algorithms as well as other mechanisms to handle the imbalanced process arrival pattern problem.…”
Section: Related Workmentioning
confidence: 99%
“…Our proposed scheme combines this idea with contention-free realization of reduce-scatter and all-gather operations. Various architecture specific allreduce schemes have also been developed [1,3,5,7,17]. In particular, all-reduce algorithms were developed specifically for SMP clusters in [17].…”
Section: Related Workmentioning
confidence: 99%