2018
DOI: 10.1109/tcad.2018.2857098
|View full text |Cite
|
Sign up to set email alerts
|

Heterogeneous FPGA-Based Cost-Optimal Design for Timing-Constrained CNNs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 37 publications
(23 citation statements)
references
References 18 publications
0
23
0
Order By: Relevance
“…To fully utilize the computation power provided by multiple FPGAs, a typical technique is to implement the neural network on multiple FPGAs in a pipelined fashion [15], [20], [22], [23]. Figure 2 demonstrates one such example, in which a 5-layer network is partitioned into 3 pipeline stages, and each pipeline stage is mapped to a certain FPGA in an available pool.…”
Section: B Implementing Dnns On Fpgasmentioning
confidence: 99%
See 1 more Smart Citation
“…To fully utilize the computation power provided by multiple FPGAs, a typical technique is to implement the neural network on multiple FPGAs in a pipelined fashion [15], [20], [22], [23]. Figure 2 demonstrates one such example, in which a 5-layer network is partitioned into 3 pipeline stages, and each pipeline stage is mapped to a certain FPGA in an available pool.…”
Section: B Implementing Dnns On Fpgasmentioning
confidence: 99%
“…Our co-exploration concept and the general framework, however, can also be easily extended to other hardware platforms such as ASICs. Since timing performance on a single FPGA is limited by its restricted resource, it is prevalent to organize multiple FPGAs in a pipelined fashion [20]- [23] to provide high throughput (frame per second, FPS). In such a system, the pipeline efficiency is one of the most important metrics needing to be maximized, since it determines the hardware utilization as well as energy efficiency.…”
Section: Introductionmentioning
confidence: 99%
“…In [16], the authors focus on designing optimal pipelined CNNs on a set of heterogeneous FPGAs. The rationale is that different tasks in the pipeline are better suited to a specific type of FPGA.…”
Section: Related Workmentioning
confidence: 99%
“…Most recently, with the growing demand in time performance, it is a trend to employ a cluster of FPGAs to execute DNNs [15,[26][27][28][29][30][31][32]. In [15,28], authors construct multiple FPGAs as a pipeline to execute a set of input images in a pipeline fashion.…”
Section: Related Workmentioning
confidence: 99%
“…In [15,28], authors construct multiple FPGAs as a pipeline to execute a set of input images in a pipeline fashion. In [26], authors split the CNN layers to balance pipeline stages for higher throughput and lower cost. Authors in [27] employ multiple FPGAs for the training phase.…”
Section: Related Workmentioning
confidence: 99%