Proceedings of the Eighth Annual ACM Symposium on Parallel Algorithms and Architectures - SPAA '96 1996
DOI: 10.1145/237502.237508
|View full text |Cite
|
Sign up to set email alerts
|

Optimal latency-throughput tradeoffs for data parallel pipelines

Abstract: This paper addressesoptimal mapping of parallel programs composed of a chain of data parallel tasks onto the proces-

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
69
0

Year Published

1997
1997
2008
2008

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 85 publications
(69 citation statements)
references
References 14 publications
0
69
0
Order By: Relevance
“…If the computations of a given stage are independent from one data set to another, two consecutive computations (different data sets) for the same stage can be mapped onto distinct processors, thus reducing the period for the processing of this stage. Such a stage can be replicated, using the terminology of Subhlok and Vondran [27,28] and of the DataCutter team [6,7,26]. This corresponds to the dealable stages of Cole [11].…”
Section: Working Out An Examplementioning
confidence: 99%
See 1 more Smart Citation
“…If the computations of a given stage are independent from one data set to another, two consecutive computations (different data sets) for the same stage can be mapped onto distinct processors, thus reducing the period for the processing of this stage. Such a stage can be replicated, using the terminology of Subhlok and Vondran [27,28] and of the DataCutter team [6,7,26]. This corresponds to the dealable stages of Cole [11].…”
Section: Working Out An Examplementioning
confidence: 99%
“…For example Subhlok and Vondran [27,28] have been able to design dynamic pro-gramming algorithms for bi-criteria mappings on homogeneous platforms. Also, if we neglect all communication costs, minimizing the period amounts to solve the well-known chains-to-chains problem.…”
Section: Introductionmentioning
confidence: 99%
“…Subhlok and Vondran [10] have proposed a dynamic programming solution for optimizing latency under throughput constraints for applications composed of a chain of data-parallel tasks. Benoit and Robert [11] study the theoretical complexity of latency and throughput optimization of pipeline and fork graphs with replication and data-parallelism under the assumptions of linear clustering and round-robin processing of input data items.…”
Section: Related Workmentioning
confidence: 99%
“…There are many prior works that use similar models to estimate throughput for linear chains of tasks [13] [14].…”
Section: Illustrating Examplesmentioning
confidence: 99%
“…There is a significant body of prior work in partitioning and mapping for high throughput [13] [20] have also been exploring solutions of partitioning applications for network processors. In contrast to the parallel program that we use in our system, the IXP-C compiler assumes that users develop large sequential programs.…”
Section: Related Workmentioning
confidence: 99%