Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays 2016
DOI: 10.1145/2847263.2847282
|View full text |Cite
|
Sign up to set email alerts
|

Automatically Optimizing the Latency, Area, and Accuracy of C Programs for High-Level Synthesis

Abstract: Loops are pervasive in numerical programs, so high-level synthesis (HLS) tools use state-of-the-art scheduling techniques to pipeline them efficiently. Still, the run time performance of the resultant FPGA implementation is limited by data dependences between loop iterations. Some of these dependence constraints can be alleviated by rewriting the program according to arithmetic identities (e.g. associativity and distributivity), memory access reductions, and control flow optimizations (e.g. partial loop unroll… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 23 publications
0
8
0
Order By: Relevance
“…Automated source-to-source transformations can result in descriptions that might not exactly match the original code. Gao et al [13] and Cong et al [14] have done similar research.…”
Section: Related Work a Qor Improvements In Hls-based Designmentioning
confidence: 63%
“…Automated source-to-source transformations can result in descriptions that might not exactly match the original code. Gao et al [13] and Cong et al [14] have done similar research.…”
Section: Related Work a Qor Improvements In Hls-based Designmentioning
confidence: 63%
“…In general, performance analysis is mainly performed at either IR level [32,36,28,20,18] or source code level [37]. Since most of the existing work performs analysis without explicitly considering back-end design flow [32,36,18,28,20], their analysis cannot reflect the optimization done by the commercial tool. On the other hand, similar to this paper, [37] builds the performance model with the help of the commercial tool, but [37] provides neither the resource model nor automated code transformation, so users still need to manually change the kernel code while considering the FPGA resource limitation.…”
Section: Related Workmentioning
confidence: 99%
“…Finally, some frameworks also focus on general-purpose programming languages such as C/C++ [21,35,18]. SOAP3 [18] is a framework that analyzes a kernel at the metasemantic intermediate representation (MIR) graph level and transforms it according to the result of design space exploration. However, SOAP3 adopts regression models for resource estimation, so the model is not general enough to cover nonlinear resource consumption.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations