2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) 2016
DOI: 10.1109/micro.2016.7783725
|View full text |Cite
|
Sign up to set email alerts
|

Fused-layer CNN accelerators

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
302
0
3

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 489 publications
(305 citation statements)
references
References 13 publications
0
302
0
3
Order By: Relevance
“…With much larger storage on the order of a few megabytes, additional dataflows can be considered. For example, Fused-Layer looks at dataflow optimizations across layers [96]. …”
Section: ) Energy Comparison Of Different Dataflowsmentioning
confidence: 99%
“…With much larger storage on the order of a few megabytes, additional dataflows can be considered. For example, Fused-Layer looks at dataflow optimizations across layers [96]. …”
Section: ) Energy Comparison Of Different Dataflowsmentioning
confidence: 99%
“…1,2 Accenting the fairness of the comparison, we note that the Single-CLP and Multi-CLP designs have the same arithmetic unit cost, which the Multi-CLP design spreads among several CLPs. Recall that a CLP requires T n × T m multipliers and adders.…”
Section: Detailed Comparison: Single-vs Multi-clpmentioning
confidence: 99%
“…[21] uses per-layer data quantization and matrix-decomposition, whereas [14] uses perlayer numerical precision reduction. [2] uses a fused-layer technique to reduce bandwidth use of convolutional layers. [25] optimizes batch sizes to reduce off-chip data transfer.…”
Section: Related Workmentioning
confidence: 99%
“…In the aforementioned approaches, some studies ( [7], [11], [19], [26]- [29]) have considered communication optimization, while the others mainly focus on the computational components. Besides the three categories, there are other communication optimization approaches for CNN accelerators (e.g., the fused-layer approach [35] that optimizes data movement between convolutional layers). Currently, no study has comprehensively analyzed the lower bound of communication in CNN accelerators.…”
Section: B Related Workmentioning
confidence: 99%