A Relaxed Quantization Training Method for Hardware Limitations of Resistive Random Access Memory (ReRAM)-Based Computing-in-Memory

Wei, Weichen; Jhang, Chuan-Jia; Chen, Yiren; Xue, Cheng-Xin; Sie, Syuan-Hao; Lee, Jye-Luen; Kuo, Hao-Wen; Lu, Chih‐Cheng; Chang, Meng‐Fan; Tang, Kea-Tiong

doi:10.1109/jxcdc.2020.2992306

Cited by 11 publications

(6 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although the IMC technology achieves high energy efficiency, when DNNs trained in software are deployed on IMC hardware, accuracy degradation can occur due to limited ADC precision, variations in the IMC devices, ambient conditions, and transistor non-linearity [11,13,19,25,27,28]. Several recent works have attempted to address these particular issues, and representative NVM-based works are described below.…”

Section: Hardware-aware Dnn Training For Accurate Dnn Inference With ...mentioning

confidence: 99%

“…However, additional area and energy overhead are incurred in the IMC hardware due to the addition of thermal reference cells. A quantization-aware DNN training scheme was proposed in [28] which considered input and weight quantization, RRAM-based convolution, and ADC quantization. However, only up to 36 rows are activated simultaneously for IMC to limit the accuracy degradation, and still, >2% accuracy loss is reported for the CIFAR-10 dataset.…”

Section: Hardware-aware Dnn Training For Accurate Dnn Inference With ...mentioning

confidence: 99%

See 1 more Smart Citation

Improving the accuracy and robustness of RRAM-based in-memory computing against RRAM hardware noise and adversarial attacks

Cherupally¹,

Meng²,

Rakin³

et al. 2022

Semicond. Sci. Technol.

View full text Add to dashboard Cite

We present a novel deep neural network (DNN) training scheme and RRAM in-memory computing (IMC) hardware evaluation towards achieving high robustness to the RRAM device/array variations and adversarial input attacks. We present improved IMC inference accuracy results evaluated on state-of-the-art DNNs including ResNet-18, AlexNet, and VGG with binary, 2-bit, and 4-bit activation/weight precision for the CIFAR-10 dataset. These DNNs are evaluated with measured noise data obtained from three different RRAM-based IMC prototype chips. Across these various DNNs and IMC chip measurements, we show that our proposed hardware noise-aware DNN training consistently improves DNN inference accuracy for actual IMC hardware, up to 8% accuracy improvement for the CIFAR-10 dataset. We also analyze the impact of our proposed noise injection scheme on the adversarial robustness of ResNet-18 DNNs with 1-bit, 2-bit, and 4-bit activation/weight precision. Our results show up to 6% improvement in the robustness to black-box adversarial input attacks.

show abstract

Section: Hardware-aware Dnn Training For Accurate Dnn Inference With ...mentioning

confidence: 99%

Section: Hardware-aware Dnn Training For Accurate Dnn Inference With ...mentioning

confidence: 99%

Improving the accuracy and robustness of RRAM-based in-memory computing against RRAM hardware noise and adversarial attacks

Cherupally¹,

Meng²,

Rakin³

et al. 2022

Semicond. Sci. Technol.

View full text Add to dashboard Cite

show abstract

“…Quantization is a discrete process, making it a suitable candidate to be learned using a Gumbel-based estimator. The authors of [61], [62] propose to learn the quantization levels, which form a quantization codebook, whereas the authors of [63] learn to select such a codebook as whole. Moreover, dataadaptive binarization has been learned as well [64], [65], [66].…”

Section: Data Compressionmentioning

confidence: 99%

A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning

Huijben¹,

Kool²,

Paulus³

et al. 2021

Preprint

View full text Add to dashboard Cite

The Gumbel-max trick is a method to draw a sample from a categorical distribution, given by its unnormalized (log-)probabilities. Over the past years, the machine learning community has proposed several extensions of this trick to facilitate, e.g., drawing multiple samples, sampling from structured domains, or gradient estimation for error backpropagation in neural network optimization. The goal of this survey article is to present background about the Gumbel-max trick, and to provide a structured overview of its extensions to ease algorithm selection. Moreover, it presents a comprehensive outline of (machine learning) literature in which Gumbel-based algorithms have been leveraged, reviews commonly-made design choices, and sketches a future perspective.

show abstract

“…While effective in reducing ADC resolution requirements, bitslicing results in severe area overhead. The work in [14] Tony Liu, Amirali Amirsoleimani, Jianxiong Xu and Roman Genov are with the Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, 10 King's College Road, Toronto, Ontario, Canada. Fabien Alibart, Yann Beilliard, Serge Ecoffey, and Dominique Drouin are with the Department of Electrical and Computer Engineering, University of Sherbrooke, QC, Canada.…”

Section: Introductionmentioning

confidence: 99%

CODEX: Stochastic Encoding Method to Relax Resistive Crossbar Accelerator Design Requirements

Liu

Amirsoleimani

et al. 2022

IEEE Trans. Circuits Syst. II

View full text Add to dashboard Cite

A stochastic input encoding scheme (CODEX) is presented that aims to relax the digital-to-analog converter (ADC) design requirements in memristor crossbar systems. CODEX reduces the ADC input range by encoding the input bits using Bernoulli statistics so that the bit-line current distribution becomes a narrow Gaussian. By reducing ADC input range, CODEX can be used to reduce ADC power and area or increase ADC resolution for faster in-situ training. Besides input data encoding, CODEX includes probability thresholding for sparse input data as well as a random re-sampling method for dealing with ADC overflow. CODEX is evaluated on CIFAR-10 dataset image classification and reconstruction, sentiment classification, and audio classification. The results show an averaged 68.5% reduction in ADC power and 35.5% reduction in ADC area as well as 25.8% increase in in-situ training speed when applied to the state-of-the-art ISAAC and PUMA accelerators.

show abstract

A Relaxed Quantization Training Method for Hardware Limitations of Resistive Random Access Memory (ReRAM)-Based Computing-in-Memory

Cited by 11 publications

References 10 publications

Improving the accuracy and robustness of RRAM-based in-memory computing against RRAM hardware noise and adversarial attacks

Improving the accuracy and robustness of RRAM-based in-memory computing against RRAM hardware noise and adversarial attacks

A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning

CODEX: Stochastic Encoding Method to Relax Resistive Crossbar Accelerator Design Requirements

Contact Info

Product

Resources

About