Proceedings of the 24th Annual International Conference on Mobile Computing and Networking 2018
DOI: 10.1145/3241539.3241559
|View full text |Cite
|
Sign up to set email alerts
|

NestDNN

Abstract: Mobile vision systems such as smartphones, drones, and augmented-reality headsets are revolutionizing our lives. These systems usually run multiple applications concurrently and their available resources at runtime are dynamic due to events such as starting new applications, closing existing applications, and application priority changes. In this paper, we present NestDNN, a framework that takes the dynamics of runtime resources into account to enable resourceaware multi-tenant on-device deep learning for mobi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
21
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 192 publications
(25 citation statements)
references
References 27 publications
0
21
0
Order By: Relevance
“…For example, to adapt to the dynamically changing available resources of IoT devices, Han et al [27] proposed deploying multiple model variants on IoT devices. Fang et al [28] proposed making multiple model variants share parameters to save the limited storage resources of the IoT device. Additionally, to reduce the number of model parameters when the cloud server assists in training the IoT neural network model, many researchers proposed using model compression [29] or knowledge distillation [30] to reduce the amount of model parameter transmission.…”
Section: Cloud-assisted Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, to adapt to the dynamically changing available resources of IoT devices, Han et al [27] proposed deploying multiple model variants on IoT devices. Fang et al [28] proposed making multiple model variants share parameters to save the limited storage resources of the IoT device. Additionally, to reduce the number of model parameters when the cloud server assists in training the IoT neural network model, many researchers proposed using model compression [29] or knowledge distillation [30] to reduce the amount of model parameter transmission.…”
Section: Cloud-assisted Approachmentioning
confidence: 99%
“…Model compression techniques can further reduce the number of model parameters based on our method. The study of adapting IoT devices [27], [28] can also be built on our method.…”
Section: Cloud-assisted Approachmentioning
confidence: 99%
“…To mitigate/avoid these problems, [6], [7] suggested ondevice resource management. DeepMon [6] aims to guarantee continuous vision apps by optimizing the convolutional neural networks (CNN) on mobile GPUs.…”
Section: A Resource Management For Multiple Vision Appsmentioning
confidence: 99%
“…It accelerated the convolution by reusing the intermediate results via caching. NestDNN [7] proposed a filter pruning for resource management. However, both DeepMon and NestDNN support only non-real-time tasks, probably because the most commonly used real-time tasks (e.g., object detection or tracking) require significant amounts of computation that the edge system cannot complete in a timely manner (e.g., DeepMon shows only 1∼2 FPS).…”
Section: A Resource Management For Multiple Vision Appsmentioning
confidence: 99%
“…Targeting on-device deep learning, some researchers define multi-tenant as processing multiple computer vision applications for multiple concurrent tasks [77,78]. However, they focus on the multi-tenant on-device inference rather than training.…”
Section: Multi-tenancy Of Federated Learningmentioning
confidence: 99%