Network Sketching: Exploiting Binary Structure in Deep CNNs

Guo, Yiwen; Yao, Anbang; Zhao, Hao; Chen, Yurong

doi:10.1109/cvpr.2017.430

Cited by 80 publications

(102 citation statements)

References 14 publications

Supporting

Mentioning

101

Contrasting

Order By: Relevance

“…BNNs [23,40] propose to constrain both weights and activations to binary values (i.e., +1 and -1), where the multiply-accumulations can be replaced by purely xnor(·) and popcount(·) operations. To make a trade-off between accuracy and complexity, [13,15,29,48] propose to recursively perform residual quantization and yield a series of binary tensors with decreasing magnitude scales. However, multiple binarizations are sequential process which cannot be paralleled.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

Zhuang

Shen

Tan

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

154

View full text Add to dashboard Cite

In this paper, we propose to train convolutional neural networks (CNNs) with both binarized weights and activations, leading to quantized models specifically for mobile devices with limited power capacity and computation resources. Previous works on quantizing CNNs seek to approximate the floating-point information using a set of discrete values, which we call value approximation, but typically assume the same architecture as the full-precision networks.In this paper, however, we take a novel "structure approximation" view for quantization-it is very likely that a different architecture may be better for best performance. In particular, we propose a "network decomposition" strategy, named Group-Net, in which we divide the network into groups. In this way, each full-precision group can be effectively reconstructed by aggregating a set of homogeneous binary branches. In addition, we learn effective connections among groups to improve the representational capability. Moreover, the proposed Group-Net shows strong generalization to other tasks. For instance, we extend Group-Net for highly accurate semantic segmentation by embedding rich context into the binary structure. Experiments on both classification and semantic segmentation tasks demonstrate the superior performance of the proposed methods over various popular architectures. In particular, we outperform the previous best binary neural networks in terms of accuracy and major computation savings.

show abstract

Section: Related Workmentioning

confidence: 99%

“…We explore the difference between layer-wise and group-wise design strategies in approach can be treated as a kind of tensor approximation which has similarities with multiple binarizations methods in [13,15,29,30,48] and the differences are described in Sec. 4.…”

Section: Layer-wise Vs Group-wise Binary Decompositionmentioning

confidence: 99%

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

Zhuang

Shen

Tan

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

154

View full text Add to dashboard Cite

show abstract

“…They have demonstrated the power of BNNs in terms of speed, memory use and power consumption. But recent works such as [58,11,21,10] also reveal the strong accuracy degradation and mismatch issue during the training when BNNs are applied in complicated tasks such as ImageNet ( [12]) recognition, especially when the activation is binarized. Although some work like [43,50,13] have offered reasonable solutions to approximate fullprecision neural network, much more computation and tricks on hyperparameters are still needed to implement compared with BENN.…”

Section: Related Workmentioning

confidence: 99%

“…Compute activation a l based on binary kernel w l b and input a l−1 ; 14 end 15 Backward Pass: 16 Compute gradient ∂J ∂wt based on [50,28]; 17 Parameter Update: 18 Update w t to w t+1 with any update rules (e.g., SGD or ADAM) 19 end 20 Ensemble Update: 21 Pick the BNN when training converges; 22 Use either bagging or boosting algorithm to update weight u i of each training example i; 23 end 24 Return: K trained base classifiers for BENN;…”

mentioning

confidence: 99%

Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?

Zhu

Dong

2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

120

View full text Add to dashboard Cite

Binary neural networks (BNN) have been studied extensively since they run dramatically faster at lower memory and power consumption than floating-point networks, thanks to the efficiency of bit operations. However, contemporary BNNs whose weights and activations are both single bits suffer from severe accuracy degradation. To understand why, we investigate the representation ability, speed and bias/variance of BNNs through extensive experiments. We conclude that the error of BNNs are predominantly caused by the intrinsic instability (training time) and non-robustness (train & test time). Inspired by this investigation, we propose the Binary Ensemble Neural Network (BENN) which leverages ensemble methods to improve the performance of BNNs with limited efficiency cost. While ensemble techniques have been broadly believed to be only marginally helpful for strong classifiers such as deep neural networks, our analysis and experiments show that they are naturally a perfect fit to boost BNNs. We find that our BENN, which is faster and more robust than state-of-the-art binary networks, can even surpass the accuracy of the full-precision floating number network with the same architecture.

show abstract

“…Wen et al [40] proposes a method as shown in (7), where s t := g t ∞ := max(abs(g t )) is a scaler parameter, ⊗ is the Hadamard product, abs(·) respectively returns the absolute value of each element. The method quantizes gradients to ternary values that can effectively improve clients-to-server communication in distributed learning.g Guo et al [41] propose greedy approximation, which instead tries to learn the quantization as shown in (8), where B i is a binary filter, α i are optimization parameters and input channels (c)× width (w)× height (h) is the size of the filter.…”

Section: Quantization Model Of Convolutional Neural Networkmentioning

confidence: 99%

DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition

et al. 2018

View full text Add to dashboard Cite

Deep Convolutional Neural Networks (DCNNs) are currently popular in human activity recognition (HAR) applications. However, in the face of modern artificial intelligence sensor-based games, many research achievements cannot be practically applied on portable devices (i.e., smart phone, VR/AR). DCNNs are typically resource-intensive and too large to be deployed on portable devices, thus this limits the practical application of complex activity detection. In addition, since portable devices do not possess high-performance Graphic Processing Units (GPUs), there is hardly any improvement in Action Game (ACT) experience. Besides, in order to deal with multi-sensor collaboration, all previous human activity recognition models typically treated the representations from different sensor signal sources equally. However, distinct types of activities should adopt different fusion strategies. In this paper, a novel scheme is proposed. This scheme is used to train 2-bit Convolutional Neural Networks with weights and activations constrained to {-0.5, 0, 0.5}. It takes into account the correlation between different sensor signal sources and the activity types. This model, which we refer to as DFTerNet, aims at producing a more reliable inference and better trade-offs for practical applications. It's known that quantization of weights and activations can substantially reduce memory size and use more efficient bitwise operations to replace floating or matrix operations to achieve much faster calculation and lower power consumption. Our basic idea is to exploit quantization of weights and activations directly in pre-trained filter banks and adopt dynamic fusion strategies for different activity types. Experiments demonstrate that by using a dynamic fusion strategy, it is possible to exceed the baseline model performance by up to ∼5% on activity recognition datasets like the OPPORTUNITY and PAMAP2 datasets. Using the quantization method proposed, we were able to achieve performances closer to that of the full-precision counterpart. These results were also verified using the UniMiB-SHAR dataset. In addition, the proposed method can achieve ∼9× acceleration on CPUs and ∼11× memory saving.

show abstract

Network Sketching: Exploiting Binary Structure in Deep CNNs

Cited by 80 publications

References 14 publications

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?

DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition

Contact Info

Product

Resources

About