2018
DOI: 10.1016/j.jpdc.2017.11.006
|View full text |Cite
|
Sign up to set email alerts
|

DITVA: Dynamic Inter-Thread Vectorization Architecture

Abstract: International audienceIn the Single-Program Multiple-Data (SPMD) programming model, threads of an application exhibit very similar control flows and often execute the same instructions, but on different data. In this paper, we propose the Dynamic Inter-thread Vectorization Architecture (DITVA) to leverage the implicit Data Level Parallelism that exists across threads on SPMD applications. By assembling dynamic vector instructions at runtime, DITVA extends an in-order SMT processor with a dynamic inter-thread v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(10 citation statements)
references
References 33 publications
(34 reference statements)
0
10
0
Order By: Relevance
“…(2) a convergence detector to facilitate path re-convergence [Kalathingal et al 2017] ; (3) a Pathtable, which tracks path masks as the paths diverge and re-converge; (4) a Divergence Predictor for speculating warp divergences; and (5) a PIRAT for orchestrating SIMT-X renaming. SIMT-X has the capability to execute both statically vectorized SIMD instructions generated by the compiler, and instructions dynamically formed by the microarchitecture from scalar threads.…”
Section: Implementing Simt-xmentioning
confidence: 99%
See 4 more Smart Citations
“…(2) a convergence detector to facilitate path re-convergence [Kalathingal et al 2017] ; (3) a Pathtable, which tracks path masks as the paths diverge and re-converge; (4) a Divergence Predictor for speculating warp divergences; and (5) a PIRAT for orchestrating SIMT-X renaming. SIMT-X has the capability to execute both statically vectorized SIMD instructions generated by the compiler, and instructions dynamically formed by the microarchitecture from scalar threads.…”
Section: Implementing Simt-xmentioning
confidence: 99%
“…Considering general-purpose parallel applications, it is possible that MinSP-PC may cause livelocks [ElTantawy and Aamodt 2016]. Accordingly, a hybrid Round-Robin (RR) approach was proposed by Kalathingal et al [2017] to guarantee forward progress, activated periodically every (m + 1) × n cycles, where m is the number of threads/warp with a total of n warps.…”
Section: Scheduling Pathsmentioning
confidence: 99%
See 3 more Smart Citations