2015
DOI: 10.1007/s00607-015-0444-y
|View full text |Cite
|
Sign up to set email alerts
|

ParVec: vectorizing the PARSEC benchmark suite

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
20
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
8

Relationship

4
4

Authors

Journals

citations
Cited by 18 publications
(25 citation statements)
references
References 14 publications
2
20
0
Order By: Relevance
“…While doubling the number of cores results in approximately double the average power dissipated by the CPU, using vector units in Intel comes almost "for free" in terms of average power. Similar results have also been reported for several Intel and ARM CPUs in [3]. As a consequence, vector processing can be an attractive solution to improve energy efficiency without sacrificing performance, especially in a situation where performance trade-off is not desirable.…”
Section: Summary Of Findingssupporting
confidence: 83%
See 2 more Smart Citations
“…While doubling the number of cores results in approximately double the average power dissipated by the CPU, using vector units in Intel comes almost "for free" in terms of average power. Similar results have also been reported for several Intel and ARM CPUs in [3]. As a consequence, vector processing can be an attractive solution to improve energy efficiency without sacrificing performance, especially in a situation where performance trade-off is not desirable.…”
Section: Summary Of Findingssupporting
confidence: 83%
“…However, extracting performance from SIMD-based computations is not trivial as it can be significantly affected by irregular data access. In multi-core programming memory bandwidth often becomes a critical bottleneck, as the data movement from memory to the register bank increases with both vectorization and parallelization [3]. On the other hand, memory latency can sometimes be hidden by optimization techniques, such as SIMD prefetching or improved data locality (e.g., cache blocking).…”
Section: Vectorizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Bhadauria et al [18] evaluated PARSEC benchmarks using hardware performance counters. A vectorized version of PARSEC is introduced and characterized using hardware performance counters by Cebrian et al [19]. The PARSEC paper [2] includes a hardwarecentric analysis of the benchmarks such as working set size, cache miss rates, shared data, cache traffic, and off-chip traffic, but it is based on simulations, not real machines.…”
Section: Related Workmentioning
confidence: 99%
“…We tried to fill this gap for multicore platforms by parallelizing the benchmarks of the PARSEC suite using the pattern-based approach. The PARSEC suite covers a wide range of working set size, locality patterns, data sharing, synchronizations, and memory bandwidth requirements, which have made it particularly attractive for several research works [14,23,42].…”
Section: Related Workmentioning
confidence: 99%