Relay: a new IR for machine learning frameworks

Roesch, Jared; Lyubomirsky, Steven; Weber, Logan; Pollock, Josh; Kirisame, Marisa; Chen, Tianqi; Tatlock, Zachary

doi:10.1145/3211346.3211348

Cited by 83 publications

(35 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The deep learning [10] [2] IR) resides in the front-end, and the low-level intermediate representation (tensor IR) resides in the back-end, but is relatively traditional. TVM can better obtain the overall information of the application and complete specific optimizations (such as graph optimization) for deep learning.…”

Section: Tvm Compilation Architecture and Matrix-dsp Analysis 21 Tvmmentioning

confidence: 99%

Matrix-DSP back-end support based on TVM compilation structure

et al. 2021

View full text Add to dashboard Cite

The emergence of deep learning frameworks has greatly facilitated the construction of network models, but it has not solved the problem of network models deployed in different hardware backends. TVM combines hardware-independent optimization and hardware-related optimization decoupling ideas to provide excellent solutions. By analyzing the basic structure of TVM and the basic process of neural network deployment on hardware, TVM has realized the basic support of the independently developed chip Matrix-DSP, which provides a foundation for further exploring the performance of the chip and enriching the application scenarios of the chip.

show abstract

Section: Tvm Compilation Architecture and Matrix-dsp Analysis 21 Tvmmentioning

confidence: 99%

Matrix-DSP back-end support based on TVM compilation structure

et al. 2021

View full text Add to dashboard Cite

show abstract

“…TVM [21] is an optimizing compiler architecture with a large open source community and probably the highest number of supported hardware architectures. It features integration for TensorFlow and PyTorch, but so far only supports inference workloads (their high level IR "Relay" already supports training but not the lower level implementations).…”

Section: Optimizing Compilers and Middlewarementioning

confidence: 99%

SOL: Effortless Device Support for AI Frameworks without Source Code Changes

Weber¹,

Huici²

2020

Preprint

View full text Add to dashboard Cite

Modern high performance computing clusters heavily rely on accelerators to overcome the limited compute power of CPUs. These supercomputers run various applications from different domains such as simulations, numerical applications or artificial intelligence (AI). As a result, vendors need to be able to efficiently run a wide variety of workloads on their hardware.In the AI domain this is in particular exacerbated by the existance of a number of popular frameworks (e.g, PyTorch, TensorFlow, etc.) that have no common code base, and can vary in functionality. The code of these frameworks evolves quickly, making it expensive to keep up with all changes and potentially forcing developers to go through constant rounds of upstreaming.In this paper we explore how to provide hardware support in AI frameworks without changing the framework's source code in order to minimize maintenance overhead. We introduce SOL, an AI acceleration middleware that provides a hardware abstraction layer that allows us to transparently support heterogenous hardware. As a proof of concept, we implemented SOL for PyTorch with three backends: CPUs, GPUs and vector processors.

show abstract

“…To explore hardware-software splits, we begin with ML workloads written in Relay, which is the intermediate representation used by the TVM compiler [1]. Relay represents a machine learning workload as a series of kernel calls, but does not make explicit the underlying hardware and software components described above.…”

Section: Overview Of Solutionmentioning

confidence: 99%

Enumerating Hardware-Software Splits with Program Rewriting

Smith,

Tatlock,

Ceze

2020

Preprint

Self Cite

View full text Add to dashboard Cite

Relay: a new IR for machine learning frameworks

Cited by 83 publications

References 26 publications

Matrix-DSP back-end support based on TVM compilation structure

Matrix-DSP back-end support based on TVM compilation structure

SOL: Effortless Device Support for AI Frameworks without Source Code Changes

Enumerating Hardware-Software Splits with Program Rewriting

Contact Info

Product

Resources

About