Abstract:The coarse-grained reconfigurable architecture (CGRA) is a promising platform that provides both high performance and high power-efficiency. The compute-intensive portions of an application (e.g. loops) are often mapped onto CGRA for acceleration. To optimize the mapping of loop nests to CGRA, this paper makes two contributions: i) Establishing a precise CGRA performance model and formulating the loop nests mapping as a nonlinear optimization problem based on polyhedral model, ii) Extracting an efficient heuri… Show more
“…First, the loop nest is unrolled and an iteration domain is derived from the bounds of loops [9], where each node denotes an iteration (loop body); next, the iteration domain is tiled into small loop tiling blocks; finally each block is mapped onto PEA in the left-to-right and top-to-bottom order. Here we use the PEA resource tile (PRT) defined in [6] to describe the mapping relationship between PEA and the iteration domain. The PRT indicates the maximal size of iterations a PEA could hold, and each PRT refers to a PEA operation of CGRA.…”
Section: A General Loop Mapping Methodsmentioning
confidence: 99%
“…When all the durations of every PEA operation are determined, we will obtain the start time of each PEA operation, denoted as t p . According to [6], each duration is determined by the form of loop nests mapping, so finding a suitable mapping method is the objective.…”
Section: The Distribution Of Currentsmentioning
confidence: 99%
“…4 (b). Like the work in [6], we define Regular PRT (R-PRT) as PRT full filled with iterations, and define Irregular PRT (I-PRT) as PRT partial filled with iterations. Figure 4 (c) uses the blue tiles and the yellow tiles to represent R-PRTs and I-PRTs respectively.…”
Section: (C) When the Polyhedral Parametersmentioning
confidence: 99%
“…In [5], a method called EPIMap is proposed to obtain optimal mapping scheme, which uses recomputation for resource limitation and obtained optimized II for a single-level loop and is based on modulo scheduling. In [6], the proposed approach called PolyMap uses polyhedral model to do mapping optimization of nested loop, where only the total execution time of CGRA is taken as an optimization metric and regardless of energy consumption. In [7], the active area of FPGA is considered as a key…”
SUMMARYCoarse-grained Reconfigurable Architecture (CGRA) is a promising mobile computing platform that provides both high performance and high energy efficiency. In an application, loop nests are usually mapped onto CGRA for further acceleration, so optimizing the mapping is an important goal for design of CGRAs. Moreover, obviously almost all of mobile devices are powered by batteries, how to reduce energy consumption also becomes one of primary concerns in using CGRAs. This paper makes three contributions: a) Proposing an energy consumption model for CGRA; b) Formulating loop nests mapping problem to minimize the battery charge loss; c) Extract an efficient heuristic algorithm called BPMap. Experiment results on most kernels of the benchmarks and real-life applications show that our methods can improve the performance of the kernels and lower the energy consumption.
“…First, the loop nest is unrolled and an iteration domain is derived from the bounds of loops [9], where each node denotes an iteration (loop body); next, the iteration domain is tiled into small loop tiling blocks; finally each block is mapped onto PEA in the left-to-right and top-to-bottom order. Here we use the PEA resource tile (PRT) defined in [6] to describe the mapping relationship between PEA and the iteration domain. The PRT indicates the maximal size of iterations a PEA could hold, and each PRT refers to a PEA operation of CGRA.…”
Section: A General Loop Mapping Methodsmentioning
confidence: 99%
“…When all the durations of every PEA operation are determined, we will obtain the start time of each PEA operation, denoted as t p . According to [6], each duration is determined by the form of loop nests mapping, so finding a suitable mapping method is the objective.…”
Section: The Distribution Of Currentsmentioning
confidence: 99%
“…4 (b). Like the work in [6], we define Regular PRT (R-PRT) as PRT full filled with iterations, and define Irregular PRT (I-PRT) as PRT partial filled with iterations. Figure 4 (c) uses the blue tiles and the yellow tiles to represent R-PRTs and I-PRTs respectively.…”
Section: (C) When the Polyhedral Parametersmentioning
confidence: 99%
“…In [5], a method called EPIMap is proposed to obtain optimal mapping scheme, which uses recomputation for resource limitation and obtained optimized II for a single-level loop and is based on modulo scheduling. In [6], the proposed approach called PolyMap uses polyhedral model to do mapping optimization of nested loop, where only the total execution time of CGRA is taken as an optimization metric and regardless of energy consumption. In [7], the active area of FPGA is considered as a key…”
SUMMARYCoarse-grained Reconfigurable Architecture (CGRA) is a promising mobile computing platform that provides both high performance and high energy efficiency. In an application, loop nests are usually mapped onto CGRA for further acceleration, so optimizing the mapping is an important goal for design of CGRAs. Moreover, obviously almost all of mobile devices are powered by batteries, how to reduce energy consumption also becomes one of primary concerns in using CGRAs. This paper makes three contributions: a) Proposing an energy consumption model for CGRA; b) Formulating loop nests mapping problem to minimize the battery charge loss; c) Extract an efficient heuristic algorithm called BPMap. Experiment results on most kernels of the benchmarks and real-life applications show that our methods can improve the performance of the kernels and lower the energy consumption.
“…Besides the hardware structure of CGRA, the configuration process plays an increasingly important role in terms of improving performance and reducing power consumption [10,11,12]. The proposed CGRA features dynamic configuration, where no more configuration contexts are updated during its computation.…”
This paper proposes a novel coarse-grained reconfigurable array (CGRA) with hierarchical context cache structure and efficient cache management approaches, including time-frequency weighted (TFW) context cache replacement strategy and context multi-casting (CMC) mechanism. By fully exploiting inherent configuration features, the configuration performance is improved by 18.2% with half context memory cost. Our CGRA was implemented under the process of TSMC 65 nm, which can work at the frequency of 200 MHz with the area of 23.2 mm 2 . Compared to the previous CGRAs, our work has the advantage of 3.8∼12× performance improvement and 2.3∼15.7× energy efficiency increase.
With the increasing demand for high performance computing in application domains with stringent power budgets, coarse-grained reconfigurable array (CGRA) architectures have become a popular choice among researchers and manufacturers. Loops are the hot-spots of kernels running on CGRAs and hence several techniques have been devised to optimize the loop execution. However, works in this direction are predominantly software-based solutions. This paper addresses the optimization opportunities at a deeper level and introduces a hardware based loop control mechanism that can support arbitrarily nested loops up to four levels. Major contributions of this work are, a lightweight Hardware Loop Block (HLB) for CGRAs that eliminates control instruction overhead of loops and an acyclic graph transformation that removes loop branches from the application CDFG. When tested on a set of kernels chosen from various application domains, the design could achieve a maximum of 1.9× and an average of 1.5× speed-up against the conventional approach. The total number of instructions executed is reduced to half for almost all the kernels with an area and power consumption overhead of 2.6% and 0.8% respectively.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.