Proceedings of the 11th ACM Conference on Computing Frontiers 2014
DOI: 10.1145/2597917.2597932
|View full text |Cite
|
Sign up to set email alerts
|

Embracing heterogeneity with dynamic core boosting

Abstract: Uniformly distributing parallel workloads amongst threads is an effective strategy for programmers to increase application performance. However, in any parallel segment, execution time is determined by the longest running thread. Even for embarrassingly parallel programs in the form of SPMD (single program multiple data), the threads are not perfectly balanced due to control flow divergence, non-deterministic memory latencies, and synchronization operations. Such an imbalance can be significantly exacerbated b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
3
0

Year Published

2016
2016
2017
2017

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 31 publications
1
3
0
Order By: Relevance
“…The total number of iterations (the variable ntimes, passed to tb total) can be different for different threads and from one barrier-interval to the next. This style of software annotation is similar to previous work [10]- [12]. Instrumentation is a one-time effort.…”
Section: A Application-level Thread Progress Informationsupporting
confidence: 59%
See 2 more Smart Citations
“…The total number of iterations (the variable ntimes, passed to tb total) can be different for different threads and from one barrier-interval to the next. This style of software annotation is similar to previous work [10]- [12]. Instrumentation is a one-time effort.…”
Section: A Application-level Thread Progress Informationsupporting
confidence: 59%
“…Meeting Points [10] is an early work that uses application-level thread progress reporting annotation, but is only applicable to parallel loops with balanced workloads. Dynamic Core Boosting [12] is a recent work that uses application-level thread progress reporting annotation that accommodates imbalanced workloads, like ThreadBeats. However, the focus of that work is on accelerating cores in asymmetric chip multiprocessors using coarse-grained boosting techniques, whereas ThreadBeats is focused on tuning finegrained per-core DVFS in symmetric chip multiprocessors.…”
Section: Related Work Instruction Count Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, it only works for balanced programs, in which all threads run the same loop count with similar amount of computation. DCB [40] extends the model to unbalanced loops and to threads with different code in pipeline parallel programs, using an epoch-based approach in the latter case. However, it requires a significant profiling and instrumentation of the application.…”
Section: B Criticality Estimation and Exploitationmentioning
confidence: 99%