Block Walsh–Hadamard Transform-based Binary Layers in Deep Neural Networks

Pan, Hongyi; Badawi, Diaa; Çetin, A. Enis

doi:10.1145/3510026

Cited by 9 publications

(3 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…WHT presents a computational challenge when the dimension of the input vector is not a power of two. A technique called blockwise WHTs (BWHTs) was introduced to address this issue in [28]. The BWHT approach divides the transform matrix into multiple blocks, each sized to an integer power of two.…”

Section: Background a Walsh-hadamard Transformmentioning

confidence: 99%

“…On the other hand, the projection operation employs a 1-D-BWHT layer to reduce the dimensional to make the network computationally efficient while retaining essential features. In Pan et al's study [28], these transformations maintained a matching accuracy under frequency transforms while achieving significant compression than standard implementation on benchmark datasets such as CIFAR-10, CIFAR-100, and ImageNet. The number of parameters in the BWHT layer is thus proportional to the thresholding parameter T , which is significantly smaller than the number of parameters in a 1 × 1 convolution layer.…”

Section: B Frequency-domain Compression Of Deep Neural Networkmentioning

confidence: 99%

“…Moreover, WHT, as considered in this example, is well-suited for low-power and computationally efficient processing, as the transformation matrices only consist of binary values. More detailed characterization of WHT-based frequency transformation was presented in [28] and [29]. Motivated by these findings, this work primarily focuses on WHT-based model compression.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

ADC/DAC-Free Analog Acceleration of Deep Neural Networks With Frequency Transformation

Darabi,

Hashem,

Pan

et al. 2024

IEEE Trans. VLSI Syst.

Self Cite

View full text Add to dashboard Cite

The edge processing of deep neural networks (DNNs) is becoming increasingly important due to its ability to extract valuable information directly at the data source to minimize latency and energy consumption. Although pruning techniques are commonly used to reduce model size for edge computing, they have certain limitations. Frequency-domain model compression, such as with the Walsh-Hadamard transform (WHT), has been identified as an efficient alternative. However, the benefits of frequency-domain processing are often offset by the increased multiply-accumulate (MAC) operations required. This article proposes a novel approach to an energy-efficient acceleration of frequency-domain neural networks by utilizing analog-domain frequency-based tensor transformations. Our approach offers unique opportunities to enhance computational efficiency, resulting in several high-level advantages, including array microarchitecture with parallelism, analog-to-digital converter (ADC)/digital-to-analog converter (DAC)-free analog computations, and increased output sparsity. Our approach achieves more compact cells by eliminating the need for trainable parameters in the transformation matrix. Moreover, our novel array microarchitecture enables adaptive stitching of cells column-wise and row-wise, thereby facilitating perfect parallelism in computations. Additionally, our scheme enables ADC/DAC-free computations by training against highly quantized matrix-vector products, leveraging the parameter-free nature of matrix multiplications. Another crucial aspect of our design is its ability to handle signed-bit processing for frequencybased transformations. This leads to increased output sparsity and reduced digitization workload. On a 16 × 16 crossbars, for 8-bit input processing, the proposed approach achieves the energy efficiency of 801 tera operations per second per Watt (TOPS/W) without early termination strategy and 2655 TOPS/W with early termination strategy at VDD = 0.85 V for 16-nm predictive technology models (PTM).

show abstract

Section: Background a Walsh-hadamard Transformmentioning

confidence: 99%

Section: B Frequency-domain Compression Of Deep Neural Networkmentioning

confidence: 99%