SparseTrain

Gong, Zhangxiaowen; Ji, Houxiang; Fletcher, Christopher W.; Hughes, Christopher J.; Torrellas, Josep

doi:10.1145/3410463.3414655

Cited by 13 publications

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

On-the-Fly Lowering Engine: Offloading Data Layout Conversion for Convolutional Neural Networks

et al. 2022

View full text Add to dashboard Cite

Many deep learning frameworks utilize GEneral Matrix Multiplication (GEMM)-based convolution to accelerate CNN execution. GEMM-based convolution provides faster convolution yet requires a data conversion process called lowering (i.e., im2col), which incurs significant memory overhead and diminishes performance. This paper proposes a novel hardware mechanism, called On-the-fly Lowering Engine (OLE), to eliminate the lowering overheads. Our goal is to offload the lowering overheads from the GEMMbased convolution. With OLE, the lowered matrix is neither pre-calculated nor stored in the main memory. Instead, a hardware engine generates lowered matrix on-the-fly from the original input matrix to reduce memory footprint and bandwidth requirements. Furthermore, the hardware offloading eliminates CPU cycles for lowering operation and overlaps computation with lowering to hide the performance overhead. Our evaluation shows that OLE can reduce memory footprint of convolutional layer inputs down to 1 12.5 × and the overall memory footprint by up to 33.5%. Moreover, OLE can reduce the execution time of convolutional layers by 57.7% on average, resulting in an average speedup of 2.3× for representative CNN models.

show abstract

On-the-Fly Lowering Engine: Offloading Data Layout Conversion for Convolutional Neural Networks

et al. 2022

View full text Add to dashboard Cite

show abstract

ARTS: An adaptive regularization training schedule for activation sparsity exploration

Zhu

Pourtaherian²,

Waeijen³

et al. 2022

2022 25th Euromicro Conference on Digital System Design (DSD)

View full text Add to dashboard Cite

published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rightsCopyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.• Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal.If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the "Taverne" license above, please follow below link for the End User

show abstract