Compact and Computationally Efficient Representation of Deep Neural Networks

Wiedemann, Simon; Müller, Klaus‐Robert; Samek, Wojciech

doi:10.1109/tnnls.2019.2910073

Cited by 64 publications

(43 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These matrix data structures do not only offer compression gains, but also an efficient execution of the associated dot product algorithm [54]. Similarly, [14] proposed two novel matrix representation, the Compressed Entropy Row (CER) and Compressed Shared Elements Row (CSER) representations, that are provably more optimal than the CSR with regards to both, compression and execution efficiency when the networks parameters have low entropy statistics.…”

Section: B Lossless Neural Network Compressionmentioning

confidence: 99%

DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks

Wiedemann

Kirchhoffer

Matlage

et al. 2020

IEEE J. Sel. Top. Signal Process.

Self Cite

View full text Add to dashboard Cite

The field of video compression has developed some of the most sophisticated and efficient compression algorithms known in the literature, enabling very high compressibility for little loss of information. Whilst some of these techniques are domain specific, many of their underlying principles are universal in that they can be adapted and applied for compressing different types of data. In this work we present DeepCABAC, a compression algorithm for deep neural networks that is based on one of the state-of-the-art video coding techniques. Concretely, it applies a Context-based Adaptive Binary Arithmetic Coder (CABAC) to the network's parameters, which was originally designed for the H.264/AVC video coding standard and became the state-of-the-art for lossless compression. Moreover, DeepCABAC employs a novel quantization scheme that minimizes the rate-distortion function while simultaneously taking the impact of quantization onto the accuracy of the network into account. Experimental results show that DeepCABAC consistently attains higher compression rates than previously proposed coding techniques for neural network compression. For instance, it is able to compress the VGG16 ImageNet model by x63.6 with no loss of accuracy, thus being able to represent the entire network with merely 8.7MB. The source code for encoding and decoding can be found at https://github.com/fraunhoferhhi/DeepCABAC.

show abstract

Section: B Lossless Neural Network Compressionmentioning

confidence: 99%

DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks

Wiedemann

Kirchhoffer

Matlage

et al. 2020

IEEE J. Sel. Top. Signal Process.

Self Cite

View full text Add to dashboard Cite

show abstract

“…The computational cost of the in-place additions is negligible. Note that model compression [23,24] and efficient representations [25] can further reduce the computational costs. Fig.…”

Section: Multi-kernel Prediction Networkmentioning

confidence: 99%

“…(a) input burst (b) KPN[14] (c) KPN L25 (d) MKPN (e) ground truth Example of denoising an image of a bear at Gain ∝ 4. The detailed fur is recovered best by MKPN.…”

mentioning

confidence: 99%

Multi-Kernel Prediction Networks for Denoising of Burst Images

Marinč

Srinivasan

Gül

et al. 2019

2019 IEEE International Conference on Image Processing (ICIP)

Self Cite

View full text Add to dashboard Cite

In low light or short-exposure photography the image is often corrupted by noise. While longer exposure helps reduce the noise, it can produce blurry results due to the object and camera motion. The reconstruction of a noise-less image is an ill posed problem. Recent approaches for image denoising aim to predict kernels which are convolved with a set of successively taken images (burst) to obtain a clear image. We propose a deep neural network based approach called Multi-Kernel Prediction Networks (MKPN) for burst image denoising. MKPN predicts kernels of not just one size but of varying sizes and performs fusion of these different kernels resulting in one kernel per pixel. The advantages of our method are two fold: (a) the different sized kernels help in extracting different information from the image which results in better reconstruction and (b) kernel fusion assures retaining of the extracted information while maintaining computational efficiency. Experimental results reveal that MKPN outperforms state-of-the-art on our synthetic datasets with different noise levels.

show abstract

“…24 Complimentary to work on higher-precision efficient hardware implementation, as presented here, efforts on improving performance of low-precision networks have shown considerable progress recently. [25][26][27] Currently, these methods require off-chip processing during training and do not target online on-device learning in neuromorphic hardware.…”

Section: Introductionmentioning

confidence: 99%

Digital multiplier‐less implementation of high‐precision SDSP and synaptic strength‐based STDP

Asgari

Maybodi

Sandamirskaya

2020

Circuit Theory & Apps

View full text Add to dashboard Cite

Spiking neural networks (SNNs) can achieve lower latency and higher efficiency compared with traditional neural networks if they are implemented in dedicated neuromorphic hardware. In both biological and artificial spiking neuronal systems, synaptic modifications are the main mechanism for learning. Plastic synapses are thus the core component of neuromorphic hardware with on-chip learning capability. Recently, several research groups have designed hardware architectures for modeling plasticity in SNNs for various applications. Following these research efforts, this paper proposes multiplier-less digital neuromorphic circuits for two plasticity learning rules: the spike-driven synaptic plasticity (SDSP) and synaptic strength-based spike timing-dependent plasticity (SSSTDP). The proposed architectures have increased the precision of the plastic synaptic weights and are suitable for spiking neural network architectures with more precise calculations. The proposed models are validated in MATLAB simulations and physical implementations on a field-programmable gate array (FPGA). KEYWORDSFPGA, neuromorphic engineering, plastic synapse, spiking neural network (SNN) Int J Circ Theor Appl. 2020;48:724-738. wileyonlinelibrary.com/journal/cta

show abstract

Compact and Computationally Efficient Representation of Deep Neural Networks

Cited by 64 publications

References 40 publications

DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks

DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks

Multi-Kernel Prediction Networks for Denoising of Burst Images

Digital multiplier‐less implementation of high‐precision SDSP and synaptic strength‐based STDP

Contact Info

Product

Resources

About