The RACECAR heuristic for automatic function specialization on multi-core heterogeneous systems

Wernsing, John; Stitt, Greg; Fowers, Jeremy

doi:10.1145/2380403.2380423

Cited by 6 publications

(3 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For instance, a different concept of elasticity named elastic computing has been introduced [8] which supports portable designs (the ability to run one function on multiple platforms) but augmented with a selection model which can choose the best available implementation based on application properties, and pre-compiled performance models. The authors extend their work in [9] which introduces a heuristic for generating parallel implementations for heterogeneous computers.…”

Section: Background and Related Workmentioning

confidence: 99%

Elastic Management of Reconfigurable Accelerators

Grigoras

Tottenham

Niu

et al. 2014

2014 IEEE International Symposium on Parallel and Distributed Processing With Applications

View full text Add to dashboard Cite

This paper presents a runtime system for reconfigurable accelerators that supports elastic management: it enables effective sharing of accelerator resources across multiple applications. For each application, this runtime system allocates an appropriate amount of resources to satisfy its quality-of-service requirements, while minimising the overall execution time for a collection of applications. The effectiveness of this runtime system is due to a set of scheduling algorithms and strategies customised for different types of workloads. We demonstrate our approach by implementing a dynamic Monte Carlo bond options pricing design.

show abstract

Section: Background and Related Workmentioning

confidence: 99%

Elastic Management of Reconfigurable Accelerators

Grigoras

Tottenham

Niu

et al. 2014

2014 IEEE International Symposium on Parallel and Distributed Processing With Applications

View full text Add to dashboard Cite

show abstract

“…Performance information on each function is made available by implementation performance graphs (IPGs) similar to those described in [31], although any form of performance prediction could potentially be used. IPGs contain the execution times for a function implementation at a variety of input sizes.…”

Section: Scopes 2014mentioning

confidence: 99%

“…Lastly, our implementation of the scheduler does not consider intra-function parallelization optimizations, such as parallelizing a single function across multiple CPU cores using divide-andconquer techniques (e.g., [31]). Such optimizations are compatible with our approach and will be merged with our framework in the future.…”

Section: Limitations and Future Workmentioning

confidence: 99%

A framework for dynamic parallelization of FPGA-accelerated applications

Fowers

Liu

Stitt

2014

Proceedings of the 17th International Workshop on Software and Compilers for Embedded Systems

Self Cite

View full text Add to dashboard Cite

High-level synthesis and compiler studies have introduced many compile-time techniques for parallelizing applications. However, one fundamental limitation of compile-time optimization is the requirement for pessimistic dependence assumptions that can significantly restrict parallelism. To avoid this limitation, many compilers require a restrictive coding style that is not practical for many designers. We present a more transparent approach that aggressively parallelizes applications by dynamically analyzing actual runtime dependencies and scheduling functions onto multiple devices when dependencies allow. In addition, the approach applies FPGA-specific pipelining optimizations to exploit deep parallelism in chains of dependent functions. Experimental results show a speedup of 4.9x for a videoprocessing application compared to sequential software execution, a speedup of 5.6x compared to traditional FPGA execution, with a framework overhead of only 4%.

show abstract

An interpolation-based approach to multi-parameter performance modeling for heterogeneous systems

Rudolph

Stitt

2015

2015 IEEE 26th International Conference on Application-Specific Systems, Architectures and Processors (ASAP)

View full text Add to dashboard Cite

The RACECAR heuristic for automatic function specialization on multi-core heterogeneous systems

Cited by 6 publications

References 29 publications

Elastic Management of Reconfigurable Accelerators

Elastic Management of Reconfigurable Accelerators

A framework for dynamic parallelization of FPGA-accelerated applications

An interpolation-based approach to multi-parameter performance modeling for heterogeneous systems

Contact Info

Product

Resources

About