Proceedings of the 40th Annual International Symposium on Computer Architecture 2013
DOI: 10.1145/2485922.2485925
|View full text |Cite
|
Sign up to set email alerts
|

Convolution engine

Abstract: This paper focuses on the trade-off between flexibility and efficiency in specialized computing. We observe that specialized units achieve most of their efficiency gains by tuning data storage and compute structures and their connectivity to the data-flow and datalocality patterns in the kernels. Hence, by identifying key data-flow patterns used in a domain, we can create efficient engines that can be programmed and reused across a wide range of applications.We present an example, the Convolution Engine (CE), … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 132 publications
(1 citation statement)
references
References 26 publications
0
1
0
Order By: Relevance
“…Designing an efficient hardware architecture for PCANet, achieving 27.4 fps throughput at 1080P, while consuming only 0.5 watt. Our implementation beats typical CNN implementations such as ConvEngine [8] in both power efficiency and throughput. 3.…”
Section: Introductionmentioning
confidence: 90%
“…Designing an efficient hardware architecture for PCANet, achieving 27.4 fps throughput at 1080P, while consuming only 0.5 watt. Our implementation beats typical CNN implementations such as ConvEngine [8] in both power efficiency and throughput. 3.…”
Section: Introductionmentioning
confidence: 90%