Understanding the impact of precision quantization on the accuracy and energy of neural networks

Hashemi, Soheil; Anthony, Nicholas; Tann, Hokchhay; Bahar, R. Iris; Reda, Sherief

doi:10.23919/date.2017.7927224

Cited by 92 publications

(71 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…8-bit) to high-precision floating point (e.g. 32-bit) [14]. However, these works compare numerical formats with disparate bit-widths and thereby do not fairly provide a comprehensive, holistic study of the network efficiency.…”

Section: Introductionmentioning

confidence: 99%

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Carmichael

Langroudi

Khazanov

et al. 2019

Proceedings of the Conference for Next Generation Arithmetic 2019

View full text Add to dashboard Cite

Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision representations of data and DNN parameters are being actively studied. An interesting research question is in how low-precision networks can be ported to edge-devices with similar performance as high-precision networks. In this work, we employ the fixed-point, floating point, and posit numerical formats at ≤8-bit precision within a DNN accelerator, Deep Positron, with exact multiply-and-accumulate (EMAC) units for inference. A unified analysis quantifies the trade-offs between overall network efficiency and performance across five classification tasks.Our results indicate that posits are a natural fit for DNN inference, outperforming at ≤8-bit precision, and can be realized with competitive resource requirements relative to those of floating point.

show abstract

Section: Introductionmentioning

confidence: 99%

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Carmichael

Langroudi

Khazanov

et al. 2019

Proceedings of the Conference for Next Generation Arithmetic 2019

View full text Add to dashboard Cite

show abstract

“…After decoding inputs, multiplication and converting to fixed-point is performed similarly to that of floating point. Products are accumulated in a register, or quire in the posit literature, of width qsize as given by (4). qsize = 2 es+2 × (n − 2) + 2 + log 2 (k) , n ≥ 3 (4)…”

Section: Posit Emacmentioning

confidence: 99%

“…8-bit) to a floating point high-precision (e.g. 32-bit) [4]. The utility of these studies is limited -the comparisons are across numerical formats with different bit widths and do not provide a fair understanding of the overall system efficiency.…”

Section: Introductionmentioning

confidence: 99%

Deep Positron: A Deep Neural Network Using the Posit Number System

Carmichael

Langroudi

Khazanov

et al. 2019

2019 Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition (DATE)

View full text Add to dashboard Cite

The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on how low-precision operations can be realized for both DNN training and inference. In this work, we propose a DNN architecture, Deep Positron, with posit numerical format operating successfully at ≤8 bits for inference. We propose a precision-adaptable FPGA soft core for exact multiply-and-accumulate for uniform comparison across three numerical formats, fixed, floating-point and posit. Preliminary results demonstrate that 8-bit posit has better accuracy than 8-bit fixed or floating-point for three different low-dimensional datasets. Moreover, the accuracy is comparable to 32-bit floatingpoint on a Xilinx Virtex-7 FPGA device. The trade-offs between DNN performance and hardware resources, i.e. latency, power, and resource utilization, show that posit outperforms in accuracy and latency at 8-bit and below.

show abstract

“…The most popular approach is to introduce approximate computing techniques to CNNs and benefit from the fact that the applications utilizing CNNs are highly error resilient (i.e., a huge reduction in energy consumption can be obtained for an acceptable loss in accuracy) [7]. Approximate implementations of CNNs are based on various techniques such as innovative hardware architectures of CNN accelerators, simplified data representation, pruning of less significant neurons, approximate arithmetic operations, approximate memory access, weight compression and "in memory" computing [11,7,2]. For example, employing the FX operations has many advantages such as reduced (i) power consumption per arithmetic operation, (ii) memory capacity needed to store the weights and (iii) processor-memory data transfer time.…”

Section: Related Workmentioning

confidence: 99%

“…Our objective is to design and optimize not only with respect to the classification error, but also with respect to hardware resources needed when the final (trained) CNN is implemented in an embedded system with limited resources. As energy-efficient machine learning is a highly desired technology, various approximate implementations of CNNs have been introduced [7,2]. Contrasted to the existing neuroevolutionary approaches trying to minimize the classification error as much as possible and assuming that CNN is executed using floating point (FP) operations on a Graphical Processing Unit (GPU) [3, arXiv:1910.06854v1 [cs.NE] 15 Oct 2019 1], our target is a highly optimized CNN whose major parts are executed with reduced precision in fixed point (FX) arithmetic operations.…”

Section: Introductionmentioning

confidence: 99%

Optimizing Convolutional Neural Networks for Embedded Systems by Means of Neuroevolution

Badan

Sekanina

2019

Theory and Practice of Natural Computing

View full text Add to dashboard Cite

Automated design methods for convolutional neural networks (CNNs) have recently been developed in order to increase the design productivity. We propose a neuroevolution method capable of evolving and optimizing CNNs with respect to the classification error and CNN complexity (expressed as the number of tunable CNN parameters), in which the inference phase can partly be executed using fixed point operations to further reduce power consumption. Experimental results are obtained with TinyDNN framework and presented using two common image classification benchmark problems -MNIST and CIFAR-10.

show abstract

Understanding the impact of precision quantization on the accuracy and energy of neural networks

Cited by 92 publications

References 30 publications

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Deep Positron: A Deep Neural Network Using the Posit Number System

Optimizing Convolutional Neural Networks for Embedded Systems by Means of Neuroevolution

Contact Info

Product

Resources

About