2016
DOI: 10.1016/j.micpro.2016.07.010
|View full text |Cite
|
Sign up to set email alerts
|

VThreads: A novel VLIW chip multiprocessor with hardware-assisted PThreads

Abstract: performance increase on synthetic benchmarks, x5 on a parallel Mandelbrot implementation, 66% better on a threaded JPEG implementation, 79% better on an edge-detection benchmark and~13% improvement on DES compared to the Leon3MP CMP. In the range of 2 to 8 cores VThreads demonstrates a post-route (statistical) power reduction between 65% to 57% at an area increase of 1.2%-10% for 1-8 cores, compared to a similarly-configured Leon3MP CMP. This combination of micro-architectural features, scalability, extensibil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 36 publications
(18 reference statements)
0
3
0
Order By: Relevance
“…This is further compounded by targeting FPGAs which are less forgiving on mux-heavy designs such as a VLIW CMP. This key observation is elaborated in the companion paper [17] and will be mitigated in the next generation of the silicon using a further HDL parameter. as a ratio of the FPGA slice utilisation; with the single context configuration requiring 5% of the slices, whereas the 8C-8B requires 37.6%.…”
Section: A Ilpmentioning
confidence: 99%
See 2 more Smart Citations
“…This is further compounded by targeting FPGAs which are less forgiving on mux-heavy designs such as a VLIW CMP. This key observation is elaborated in the companion paper [17] and will be mitigated in the next generation of the silicon using a further HDL parameter. as a ratio of the FPGA slice utilisation; with the single context configuration requiring 5% of the slices, whereas the 8C-8B requires 37.6%.…”
Section: A Ilpmentioning
confidence: 99%
“…This is achieved through our core contributions which include the instantiation on the SoC-FPGA of the LE1 CMP and the subsequent on-line compilation of OpenCL kernels by our framework targeting the LE1 silicon. We also note that the LE1 is a capable MIMD accelerator, can easily accommodate shared-memory programming models such as OpenMP and POSIX Threads (PThreads) [17] and due to the proposed source transformation/compilation flow (Section IV), it does not suffer software-incurred performance inefficiencies due to thread divergence. The authors are unaware of any current heterogeneous systems that use fully configurable general-purpose, manycore, VLIW microprocessors as OpenCL accelerators on SoC-FPGAs.…”
Section: A Motivationmentioning
confidence: 99%
See 1 more Smart Citation