2015 IEEE Hot Chips 27 Symposium (HCS) 2015
DOI: 10.1109/hotchips.2015.7477457
|View full text |Cite
|
Sign up to set email alerts
|

UltraScale+ MPSoC and FPGA families

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(20 citation statements)
references
References 0 publications
0
17
0
Order By: Relevance
“…We implemented passthrough kernels which copy pixels from one memory location to another without applying any arithmetic/logical operations. In the FPGA implementation, Xilinx's SDx tool instantiates data movers [36] for each input or output port to transfer data between the memory mapped domain and the stream domain. Table 2 shows that FPGA takes 6.945 ms to copy an entire image (1080p) with 0.41 mJ/frame, while GPU takes 1.298 ms with 0.19 mJ/frame.…”
Section: Benchmarking Approachmentioning
confidence: 99%
“…We implemented passthrough kernels which copy pixels from one memory location to another without applying any arithmetic/logical operations. In the FPGA implementation, Xilinx's SDx tool instantiates data movers [36] for each input or output port to transfer data between the memory mapped domain and the stream domain. Table 2 shows that FPGA takes 6.945 ms to copy an entire image (1080p) with 0.41 mJ/frame, while GPU takes 1.298 ms with 0.19 mJ/frame.…”
Section: Benchmarking Approachmentioning
confidence: 99%
“…We have run the place and route process for the CP circuit for a Zynq Ultrascale+ FPGA [18] using Vivado. Our design occupies less than 0.1% of the FPGA (286 LUTs and 90 flipflops), and can achieve the frequency of 156.25 MHz that we targeted in our implementation; thus, our design can handle a new FRP message every 6.4 nanoseconds.…”
Section: Hardware Implementationmentioning
confidence: 99%
“…The FPGA cloud platform used in this design is based on Xilinx Ultrascale+. This kind of FPGA has a large amount of logic and memory resources, strong scalability, which can greatly reduce the delay of data in the calculation and transmission process, improve the design speed [12]. Finally, the design needs to coordinate the management of the processor and the hardware acceleration part.…”
Section: Introductionmentioning
confidence: 99%