Proceedings of the 2016 Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools 2016
DOI: 10.1145/2852339.2852342
|View full text |Cite
|
Sign up to set email alerts
|

Performance estimation of streaming applications for hierarchical MPSoCs

Abstract: Parallel programming and effective partitioning of applications for embedded many-core architectures requires optimization algorithms. However, these algorithms have to quickly evaluate thousands of different partitions. We present a fast performance estimator embedded in a parallelizing compiler for streaming applications. The estimator combines a single execution-based simulation and an analytic approach. Experimental results demonstrate that the estimator has a mean error of 2.6% and computes its estimation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 14 publications
(20 reference statements)
0
6
0
Order By: Relevance
“…Considering the calculation and communication cost of directed acyclic graph (DAG), Tang et al [8] presented two heuristic strategies based on integer linear programming to reduce communication overhead and scheduling length. Flasskamp et al [9] designed a performance estimator embedded in the compiler to partition and map streaming applications. For an MPSoC system consisting of a multi-core CPU and on-chip GPU, Vilches et al [10] introduced a novel framework that can adaptively find the best mapping for multiple tasks to achieve better performance.…”
Section: Related Workmentioning
confidence: 99%
“…Considering the calculation and communication cost of directed acyclic graph (DAG), Tang et al [8] presented two heuristic strategies based on integer linear programming to reduce communication overhead and scheduling length. Flasskamp et al [9] designed a performance estimator embedded in the compiler to partition and map streaming applications. For an MPSoC system consisting of a multi-core CPU and on-chip GPU, Vilches et al [10] introduced a novel framework that can adaptively find the best mapping for multiple tasks to achieve better performance.…”
Section: Related Workmentioning
confidence: 99%
“…Authors in [11,15,16] propose hybrid methods: first simulation is used to obtain the execution time of each procedure on each type of processing element, then analytical methods are used to account for cache and communication effects.…”
Section: Related Workmentioning
confidence: 99%
“…However, programming a complex MPSoC and making effective use of its resources is a challenging task for the programmer. For this reason we have developed a parallelizing compiler for streaming applications to assist in programming the CoreVA-MPSoC [6], [28]. This compiler, called CoreVA-MPSoC compiler, processes applications written in the StreamIt language [29].…”
Section: E Compiler Infrastructurementioning
confidence: 99%
“…A simple analytical model might be evaluated faster compared to a post place and route simulation but achieves a lower accuracy. Our approach is called Simulation Based Estimation (SBE) and combines a single execution-based simulation and an analytic approach [28]. The execution time of each tasks is measured only once and independent of MPSoC configuration and task placement, because all CPUs in our MPSoC have the same properties like, e.g., clock frequency and number of VLIW slots (cf.…”
Section: E Compiler Infrastructurementioning
confidence: 99%
See 1 more Smart Citation