2019
DOI: 10.1002/cpe.5563
|View full text |Cite
|
Sign up to set email alerts
|

Scheduling data streams for low latency and high throughput on a Cray XC40 using Libfabric

Abstract: Summary Achieving efficient many‐to‐many communication on a given network topology is a challenging task when many data streams from different sources have to be scattered concurrently to many destinations with low variance in arrival times. In such scenarios, it is critical to saturate but not to congest the bisectional bandwidth of the network topology in order to achieve a good aggregate throughput. When there are many concurrent point‐to‐point connections, the communication pattern needs to be dynamically … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 32 publications
0
2
0
Order By: Relevance
“…Unfortunately, using deep learning models for prediction introduces additional overhead. In order to achieve fair network usage and reduce latency, the scheduling method proposed by Salem et al 38 runs on a group of senders and receivers, and channels the distribution of large‐capacity data streams with high throughput, so that streams from the same observation time are quickly gathered in the compute nodes.…”
Section: Related Workmentioning
confidence: 99%
“…Unfortunately, using deep learning models for prediction introduces additional overhead. In order to achieve fair network usage and reduce latency, the scheduling method proposed by Salem et al 38 runs on a group of senders and receivers, and channels the distribution of large‐capacity data streams with high throughput, so that streams from the same observation time are quickly gathered in the compute nodes.…”
Section: Related Workmentioning
confidence: 99%
“…Motivated by a use case from the Compressed Baryonic Matter (CBM) experiment, Salem et al address the problem of achieving efficient many‐to‐many communication for a given network topology when multiple data streams from different sources need to be scattered to multiple destinations concurrently and with a low variance in arrival times. They present a nonblocking, distributed data flow scheduler that is built on libfabric and is thus compatible with many modern interconnects.…”
Section: Themes Of This Special Issuementioning
confidence: 99%