2019
DOI: 10.1145/3358198
|View full text |Cite
|
Sign up to set email alerts
|

dMazeRunner

Abstract: Dataflow accelerators feature simplicity, programmability, and energy-efficiency and are visualized as a promising architecture for accelerating perfectly nested loops that dominate several important applications, including image and media processing and deep learning. Although numerous accelerator designs are being proposed, how to discover the most efficient way to execute the perfectly nested loop of an application onto computational and memory resources of a given dataflow accelerator ( executi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 61 publications
(3 citation statements)
references
References 38 publications
0
3
0
Order By: Relevance
“…In [10], accuracy ranges from 92% to 99% when estimating energy consumption. Also, studies [13], [14], [15], show that by using analytical estimation, they can find better optimized DL accelerators fast and accurately.…”
Section: B Exploration Of Different Accelerator Designsmentioning
confidence: 99%
“…In [10], accuracy ranges from 92% to 99% when estimating energy consumption. Also, studies [13], [14], [15], show that by using analytical estimation, they can find better optimized DL accelerators fast and accurately.…”
Section: B Exploration Of Different Accelerator Designsmentioning
confidence: 99%
“…Automating Comprehensive Mapping Space Formulation: Mapping space for an NPU encapsulates all schedules (aka iteration spaces in a polyhedral compiler [49], [50]) that are possible corresponding to various loop optimizations like tiling, ordering, and unrolling, when executing a nested loop on an NPU [4], [5], [37]. To develop a compiler for a customized NPU architecture, experts have previously formulated the mapping space manually [1], [4], [34] or relied on NPU-agnostic loop optimizations [39]. Then, compiler mapped operations for a schedule by software pipelining [43], [51].…”
Section: B End-to-end Agile Design Workflowmentioning
confidence: 99%
“…An architectural template for an NPU specifies what kinds of computational and memory units can be interconnected and how. Various system stack tools for the NPU, such as cost models, simulators, and compilers, are developed manually by experts, limiting support to only the template architecture [1], [3], [4]. As workloads evolve or application requirements become stringent, novel architectural features need to be integrated and explored [5].…”
Section: Introductionmentioning
confidence: 99%