2018
DOI: 10.1109/mm.2018.032271058
|View full text |Cite
|
Sign up to set email alerts
|

Plasticine: A Reconfigurable Accelerator for Parallel Patterns

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
104
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 66 publications
(106 citation statements)
references
References 12 publications
1
104
0
1
Order By: Relevance
“…Specialized hardware can accelerate polystore systems by leveraging multicore CPUs, graphics processing unit (GPUs [28]), field programmable gate arrays (FPGAs [29]), application-specific integrated chips (ASICs [30]), or coarse grain reconfigurable arrays (CGRAs [31]) such as Plasticine [32]. Multicore CPUs consume more power per task with limited parallelism and inefficient data movement compared to well-matched applications running on accelerators.…”
Section: B Hardware Acceleratorsmentioning
confidence: 99%
See 1 more Smart Citation
“…Specialized hardware can accelerate polystore systems by leveraging multicore CPUs, graphics processing unit (GPUs [28]), field programmable gate arrays (FPGAs [29]), application-specific integrated chips (ASICs [30]), or coarse grain reconfigurable arrays (CGRAs [31]) such as Plasticine [32]. Multicore CPUs consume more power per task with limited parallelism and inefficient data movement compared to well-matched applications running on accelerators.…”
Section: B Hardware Acceleratorsmentioning
confidence: 99%
“…Each accelerator is programmed using its hardware-specific low-level language (e.g., Verilog), which requires a developer to have deep understanding of the underlying hardware. Application and hardware domain-specific languages (DSLs), such as Spatial [38], Relay [39], and Delite [40], ease application development for specific accelerators by abstracting low-level abstractions into high-level primitives (e.g., parallel patterns [32]) that a developer is familiar with. For example, Halide [41] and Tensorflow [42] are application-specific DSLs for image processing and deep neural network processing respectively.…”
Section: B Hardware Acceleratorsmentioning
confidence: 99%
“…Designs range from architectures with simple general-purpose cores with (software) configurable interconnect, such as MIT's RAW [40], to more proper CGRAs that tightly integrate a general-purpose core with an array of functional units (typically identical arithmetic-logic units), such as GARP [8], Piperench [22], ADRES [27], Tartan [31], and DySER [23]. The Plasticine [39] spatially reconfigurable design combines pattern compute units (PCUs), hierarchically composed of a reconfigurable pipeline with multiple stages of SIMD functional units, and pattern memory units (PMUs), simplifying mapping of inner loops and feedback edges to the hardware and enabling execution of applications expressed as parallel patterns. Ongoing efforts in Path Forward projects and the DARPA Electronic Resurgence Initiative (ERI) demonstrate the intense interest around CGRAs.…”
Section: Related Workmentioning
confidence: 99%
“…One solution is to design a reconfigurable architecture at word level, the Coarse‐Grained Reconfigurable Architectures (CGRAs). However, there is no commercial CGRA available in the market, and the CGRAs also suffer from long compilation times …”
Section: Related Workmentioning
confidence: 99%
“…However, there is no commercial CGRA available in the market, and the CGRAs also suffer from long compilation times. 19 One way is to focus on application and domain-specific accelerators: Neural network, 2,20 Bayesian learning, 21 bioinformatics, 22 stencil computing, 16,23 energy-efficient accelerators for graph analytics algorithms, 24 and irregular applications mapping. 25 Another way is to focus on Domain-Specific Language (DSL) which aims representing parallelism in stream-based applications, like SPar based on C++.…”
mentioning
confidence: 99%