2012 IEEE 26th International Parallel and Distributed Processing Symposium 2012
DOI: 10.1109/ipdps.2012.125
|View full text |Cite
|
Sign up to set email alerts
|

An SMT-Selection Metric to Improve Multithreaded Applications' Performance

Abstract: Abstract-Simultaneous multithreading (SMT) increases CPU utilization and application performance in many circumstances, but it can be detrimental when performance is limited by application scalability or when there is significant contention for CPU resources. This paper describes an SMT-selection metric that predicts the change in application performance when the SMT level and number of application threads are varied. This metric is obtained online through hardware performance counters with little overhead, an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…A different approach is followed by Saez et al [25], who propose a non-work-conserving scheduler that greatly speedups critical threads while still achieving slight throughput improvements on ST and SMT multicores. Also targeting SMT multicores but focused on multithreaded applications, Funston et al [26] propose an SMT metric to select the optimal number of threads per core depending on the instruction mix of the application.…”
Section: Related Workmentioning
confidence: 99%
“…A different approach is followed by Saez et al [25], who propose a non-work-conserving scheduler that greatly speedups critical threads while still achieving slight throughput improvements on ST and SMT multicores. Also targeting SMT multicores but focused on multithreaded applications, Funston et al [26] propose an SMT metric to select the optimal number of threads per core depending on the instruction mix of the application.…”
Section: Related Workmentioning
confidence: 99%
“…There were algorithms that scheduled threads so as to minimize contention for shared caches, memory controllers and multithreaded CPU pipelines [8,9,24,29,34,42,46]. There were algorithms that reduced communication distance among threads sharing data [41] and determined the optimal number of cores to allocate to multithreaded workloads [17]. There were algorithms that addressed the scheduling of threads on asymmetric multicore CPUs [22,35] and algorithms that integrated scheduling with the management of power and temperature [19].…”
Section: Lessons Learnedmentioning
confidence: 99%