2019
DOI: 10.1007/978-3-030-31756-0_10
|View full text |Cite
|
Sign up to set email alerts
|

Heterogeneous Computing System for Deep Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 5 publications
0
1
0
Order By: Relevance
“…As a comparison, in [28], the FPGAs were reported to provide up to 28.7 higher performance per watt than the corresponding GPU implementations for several LSTM models. Moreover, as stated in [29], the actual achieved performance in a GPU, for LSTM networks, ranges from 17.85% to 25% of the peak reported performance of the GPU; thus, for an NVIDIA Tesla K80 GPU, this translates from 3.33 to 4.66 GFLOPS/W of achieved performance. As a result, the created system, which triggers 5.29 GFLOPS/W, can outperform such a GPU.…”
Section: Performance Over Power Efficiencymentioning
confidence: 96%
“…As a comparison, in [28], the FPGAs were reported to provide up to 28.7 higher performance per watt than the corresponding GPU implementations for several LSTM models. Moreover, as stated in [29], the actual achieved performance in a GPU, for LSTM networks, ranges from 17.85% to 25% of the peak reported performance of the GPU; thus, for an NVIDIA Tesla K80 GPU, this translates from 3.33 to 4.66 GFLOPS/W of achieved performance. As a result, the created system, which triggers 5.29 GFLOPS/W, can outperform such a GPU.…”
Section: Performance Over Power Efficiencymentioning
confidence: 96%
“…The architecture of the lowest level in the proposed hierarchy is defined by the data structure deployed in the local memories mem0 in the MAP array, the instructions executed in each cell by eng0, and the functions performed in the logdepth networks REDUCE and SCAN (see [13]). Shortly, the architecture can be defined as follows:…”
Section: The Abstract Modelmentioning
confidence: 99%