Near-Lossless Deep Feature Compression for Collaborative Intelligence

Choi, Hyomin; Bajić, Ivan V.

doi:10.1109/mmsp.2018.8547134

Cited by 63 publications

(60 citation statements)

References 18 publications

(36 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, pushing all computations toward the cloud can lead to congestion in a scenario where a large number of mobile devices simultaneously send data to the cloud. As a compromise between the mobile-only and the cloud-only approach, recently, a body of research work has been investigating the idea of splitting a deep inference network between the mobile and cloud [6][7][8][9][10][11][12]. In this approach, which is referred to as collaborative intelligence, the computations associated with initial layers of the inference network are performed on the mobile device, and the feature tensor (activations) of the last computed layer is sent to the cloud for the remainder of computations.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services

Eshratifar

Esmaili

Pedram

2019

2019 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)

122

View full text Add to dashboard Cite

Recent studies have shown the latency and energy consumption of deep neural networks can be significantly improved by splitting the network between the mobile device and cloud. This paper introduces a new deep learning architecture, called BottleNet, for reducing the feature size needed to be sent to the cloud. Furthermore, we propose a training method for compensating for the potential accuracy loss due to the lossy compression of features before transmitting them to the cloud. BottleNet achieves on average 30× improvement in end-to-end latency and 40× improvement in mobile energy consumption compared to the cloud-only approach with negligible accuracy loss.

show abstract

Section: Introductionmentioning

confidence: 99%

“…In research studies investigating collaborative intelligence, a given deep network is split between the mobile device and the cloud without any modification to the network architecture itself [6,[8][9][10][11][12]. In this paper, we investigate altering the underlying deep model architecture to make it collaborative intelligence friendly.…”

Section: Introductionmentioning

confidence: 99%

BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services

Eshratifar

Esmaili

Pedram

2019

2019 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)

122

View full text Add to dashboard Cite

show abstract

“…However, this approach can cause congestion problems due to the growth of the volume of data transmitted over the network and the number of devices linked to the cloud. To address this problem, recent studies in collaborative intelligence have developed optimized deployment strategies [2][3][4][5][6][7].…”

Section: Introductionmentioning

confidence: 99%

“…Furthermore, considering the impact of the volume of data on the congestion problem, it is desirable to compress and transfer a lesser volume of data to the cloud, unless the inference performance loss is large [3]. Previous studies [4,5] have explored the efficacy of compressing deep feature tensors using conventional standard codecs, in the context of object detection and image classification. Also, [6] suggests a method to first reduce the dimensionality of the deep feature tensor, then compress it using a codec.…”

Section: Introductionmentioning

confidence: 99%

Back-And-Forth Prediction for Deep Tensor Compression

Choi

Cohen

Bajić

2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Self Cite

View full text Add to dashboard Cite

Recent AI applications such as Collaborative Intelligence with neural networks involve transferring deep feature tensors between various computing devices. This necessitates tensor compression in order to optimize the usage of bandwidthconstrained channels between devices. In this paper we present a prediction scheme called Back-and-Forth (BaF) prediction, developed for deep feature tensors, which allows us to dramatically reduce tensor size and improve its compressibility. Our experiments with a state-of-the-art object detector demonstrate that the proposed method allows us to significantly reduce the number of bits needed for compressing feature tensors extracted from deep within the model, with negligible degradation of the detection performance and without requiring any retraining of the network weights. We achieve a 62% and 75% reduction in tensor size while keeping the loss in accuracy of the network to less than 1% and 2%, respectively.

show abstract

“…In CI, the mobile device runs a part of the deep model between the input and some layer, generates a set of deep features, and sends them to the cloud for further processing by the remainder of the deep model, which resides in the cloud. In this context, the issues of deep feature compression [3,4,5] and transmission [6] become important. In [3], This work was supported in part by NSERC Grant RGPIN-2016-04590.…”

Section: Introductionmentioning

confidence: 99%

Multi-Task Learning with Compressible Features for Collaborative Intelligence

Alvar

Bajić

2019

2019 IEEE International Conference on Image Processing (ICIP)

Self Cite

View full text Add to dashboard Cite

A promising way to deploy Artificial Intelligence (AI)-based services on mobile devices is to run a part of the AI model (a deep neural network) on the mobile itself, and the rest in the cloud. This is sometimes referred to as collaborative intelligence. In this framework, intermediate features from the deep network need to be transmitted to the cloud for further processing. We study the case where such features are used for multiple purposes in the cloud (multi-tasking) and where they need to be compressible in order to allow efficient transmission to the cloud. To this end, we introduce a new loss function that encourages feature compressibility while improving system performance on multiple tasks. Experimental results show that with the compression-friendly loss, one can achieve around 20% bitrate reduction without sacrificing the performance on several vision-related tasks.

show abstract

Near-Lossless Deep Feature Compression for Collaborative Intelligence

Cited by 63 publications

References 18 publications

BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services

BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services

Back-And-Forth Prediction for Deep Tensor Compression

Multi-Task Learning with Compressible Features for Collaborative Intelligence

Contact Info

Product

Resources

About