Proceedings 20th IEEE International Parallel &Amp; Distributed Processing Symposium 2006
DOI: 10.1109/ipdps.2006.1639349
|View full text |Cite
|
Sign up to set email alerts
|

Exploring the design space of an optimized compiler approach for mesh-like coarse-grained reconfigurable architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
20
0
1

Year Published

2007
2007
2020
2020

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(21 citation statements)
references
References 20 publications
0
20
0
1
Order By: Relevance
“…Mainly due to lack of the program counter, the control logic in most existing CGRA templates (e.g., [5], [21], [26]) is not as versatile to fully support various conditional execution features as in ordinary microprocessors. Therefore, in order to execute a kernel loop, we transform all branches, such as if-statements, within the original loop into predicate statements [20].…”
Section: A Predicationmentioning
confidence: 99%
“…Mainly due to lack of the program counter, the control logic in most existing CGRA templates (e.g., [5], [21], [26]) is not as versatile to fully support various conditional execution features as in ordinary microprocessors. Therefore, in order to execute a kernel loop, we transform all branches, such as if-statements, within the original loop into predicate statements [20].…”
Section: A Predicationmentioning
confidence: 99%
“…That is, variables with data reuse opportunities are stored and transferred through the CGRA's interconnection network instead of using the buses. The advantage of this technique has been evaluated in [16]. Additionally, each PE consists of one Reconfigurable Functional Unit (RFU), which it can be configured to perform a specific word-level operation in every cycle.…”
Section: Cgra Architecture Descriptionmentioning
confidence: 99%
“…Also, by increasing the size of the register files, the number of variables that can be cached internally in the 2D-grid of PEs is increased. Hence, the bus is relieved from unnecessary accesses [16]. The aforementioned changes have a positive impact on the value of IPC, however, by increasing the size and number of ports of the register files, the area together with the critical path also increases leading to lower clock frequency.…”
Section: Cgra Architecture Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…This framework consists of: 1) an existing retargetable mapping methodology [7] based on a modulo scheduling technique and 2) by a parametric CGRA architecture template that has been described in hardware description language (VHDL). The latter is used for estimating the clock frequency and the area of each considered architecture instance using the Synopsis Design Compiler in the 0.13μm process of ASIC technology.…”
Section: Introductionmentioning
confidence: 99%