2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) 2014
DOI: 10.1109/isca.2014.6853234
|View full text |Cite
|
Sign up to set email alerts
|

Single-graph multiple flows: Energy efficient design alternative for GPGPUs

Abstract: We present the single-graph multiple-flows (SGMF) architecture that combines coarse-grain reconfigurable computing with dynamic dataflow to deliver massive thread-level parallelism. The CUDA-compatible SGMF architecture is positioned as an energy efficient design alternative for GPGPUs.The architecture maps a compute kernel, represented as a dataflow graph, onto a coarse-grain reconfigurable fabric composed of a grid of interconnected functional units. Each unit dynamically schedules instances of the same stat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 21 publications
(22 citation statements)
references
References 38 publications
0
21
0
Order By: Relevance
“…Such accesses can be attributed to two major causes: using the memory (global or local) for interthread communication, and having multiple threads access the same memory locations. In this paper we introduce direct inter-thread communication to the previously proposed multithreaded coarse-grain reconfigurable array (MT-CGRA) [6,7]. The proposed dMT-CGRA architecture eliminates redundant memory accesses by allowing threads to directly communicate through the CGRA fabric.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…Such accesses can be attributed to two major causes: using the memory (global or local) for interthread communication, and having multiple threads access the same memory locations. In this paper we introduce direct inter-thread communication to the previously proposed multithreaded coarse-grain reconfigurable array (MT-CGRA) [6,7]. The proposed dMT-CGRA architecture eliminates redundant memory accesses by allowing threads to directly communicate through the CGRA fabric.…”
Section: Discussionmentioning
confidence: 99%
“…The MT-CGRA execution model combines the static and dynamic dataflow models to execute single-instruction multiple-threads (SIMT) programs with better performance and power characteristics than von Neumann GPGPUs [6]. The model converts SIMT kernels into dataflow graphs and maps them to the CGRA fabric, where each functional unit multiplexes its operation on tokens from different instances of a dataflow graph (i.e., threads).…”
Section: Execution and Programming Modelmentioning
confidence: 99%
See 2 more Smart Citations
“…The dataflow co-processor in their system did not drive a display directly, but performed pre-processing on 3D data, a function not dissimilar to modern day vertex or geometry shaders. Voitsechov & Etsion present an alternative architecture for GPGPUs based on dataflow computing [15]. In their architecture instructions from CUDA kernels are mapped to a dataflow graph.…”
Section: Related Workmentioning
confidence: 99%