2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) 2018
DOI: 10.1109/micro.2018.00080
|View full text |Cite
|
Sign up to set email alerts
|

Morph: Flexible Acceleration for 3D CNN-Based Video Understanding

Abstract: The past several years have seen both an explosion in the use of Convolutional Neural Networks (CNNs) and the design of accelerators to make CNN inference practical. In the architecture community, the lion share of effort has targeted CNN inference for image recognition. The closely related problem of video recognition has received far less attention as an accelerator target. This is surprising, as video recognition is more computationally intensive than image recognition, and video traffic is predicted to be … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 62 publications
(29 citation statements)
references
References 51 publications
0
29
0
Order By: Relevance
“…Morph is a exible accelerator for 3D CNN-based video processing that offered by Katrik Hegde, et al [20].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Morph is a exible accelerator for 3D CNN-based video processing that offered by Katrik Hegde, et al [20].…”
Section: Related Workmentioning
confidence: 99%
“…Since parallel computations is an unavoidable part of CNNs, several efforts and research works have been done for designing an optimized hardware for it. As a result, many application-speci c integrated circuits (ASICs) as hardware accelerators have been introduced and evaluated in the recent decade [20]. In the next section, some of the most successful and impressive works related to CNN accelerators are introduced.…”
Section: Introductionmentioning
confidence: 99%
“…We primarily consider loop tiling, which is known to be critical to exploiting data reuse in DNNs [52,74]. Prior work in DNN tiling predominately searches for the tiling strategy in a brute-force manner [32,46]. However, brute-force search does not scale to stereo DNNs for two reasons.…”
Section: Exploiting Inter-layer Activation Reusementioning
confidence: 99%
“…Although they are specialized, accelerators are often flexible [21,36,52,59,95], designed to support different parameterizations of a single algorithm to even a range of algorithms within or across domains. This flexibility forces architects to decouple the act of designing the architecture from the act of mapping a specific problem-a parameterized instance of an algorithm-onto the architecture.…”
Section: Introductionmentioning
confidence: 99%