2018
DOI: 10.1109/mm.2018.2877289
|View full text |Cite
|
Sign up to set email alerts
|

A Communication-Centric Approach for Designing Flexible DNN Accelerators

Abstract: High computational demands of deep neural networks (DNNs) coupled with their pervasiveness across cloud and IoT platforms have led to the emergence of DNN accelerators employing hundreds of processing elements (PE). Most DNN accelerators are optimized for regular mapping of the problems, or dataflows, emanating from dense matrix multiplications in convolutional layers. However, continuous innovations in DNN including myriad layer types/shapes, cross-layer fusion, and sparsity have led to irregular dataflows wi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(12 citation statements)
references
References 13 publications
0
10
0
Order By: Relevance
“…For this reason, the third pillar of our envisaged architecture is taking a communication-centric design approach. This is a philosophy that has been applied to endow DNN accelerators with certain flexibility [33], [177], [178] or to optimize distributed learning [179]. In our case, we propose the use of a reconfigurable interconnect fabric among the PEs to adapt the hardware to the underlying graph connectivity or, in other words, to the optimal dataflow that may vary across layers, partitions, or graphs.…”
Section: Communication-centric Designmentioning
confidence: 99%
“…For this reason, the third pillar of our envisaged architecture is taking a communication-centric design approach. This is a philosophy that has been applied to endow DNN accelerators with certain flexibility [33], [177], [178] or to optimize distributed learning [179]. In our case, we propose the use of a reconfigurable interconnect fabric among the PEs to adapt the hardware to the underlying graph connectivity or, in other words, to the optimal dataflow that may vary across layers, partitions, or graphs.…”
Section: Communication-centric Designmentioning
confidence: 99%
“…General dataflow for a CNN accelerator can be categorized into four types: Weight Stationary (WS), Input Stationary (IS), Output Stationary (OS), and Row Stationary (RS), based on the taxonomy and terminology proposed in [42]. Kwon et al propose MEARI, i.e., a type of flexible dataflow mapping for DNN with reconfigurable interconnects [25,27]. Chen et al present a row-stationary dataflow adapting to their spatial architecture designed for CNNs processing [9].…”
Section: Dataflowmentioning
confidence: 99%
“…MAERI [67], [69] is a DNN accelerator design with a set of configurable building blocks consisting of multiply and adder engines that can be configured with tiny switches. This new modular design can support various dataflows of arbitrary DNNs and map them successfully on accelerator elements.…”
Section: Reconfigurable Interconnectsmentioning
confidence: 99%