2020
DOI: 10.1109/jstsp.2020.2979669
|View full text |Cite
|
Sign up to set email alerts
|

Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

Abstract: State-of-the-art convolutional neural networks (CNNs) yield record-breaking predictive performance, yet at the cost of high-energy-consumption inference, that prohibits their widely deployments in resource-constrained Internet of Things (IoT) applications. We propose a dual dynamic inference (DDI) framework that highlights the following aspects: 1) we integrate both input-dependent and resource-dependent dynamic inference mechanisms under a unified framework in order to fit the varying IoT resource requirement… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 65 publications
(37 citation statements)
references
References 35 publications
0
36
0
Order By: Relevance
“…One issue that makes creating compact NNs for efficient inference challenging is the dependency on the characteristics of the target hardware. A framework for dynamic inference on resource constrained hardware, including input-and resource dependent dynamic inference mechanisms, allowing to meet specific resource constraints, has been proposed recently [25]. First steps are being made towards network compression methods that output representations prepared for later specialization to the target platform [26].…”
Section: Related Workmentioning
confidence: 99%
“…One issue that makes creating compact NNs for efficient inference challenging is the dependency on the characteristics of the target hardware. A framework for dynamic inference on resource constrained hardware, including input-and resource dependent dynamic inference mechanisms, allowing to meet specific resource constraints, has been proposed recently [25]. First steps are being made towards network compression methods that output representations prepared for later specialization to the target platform [26].…”
Section: Related Workmentioning
confidence: 99%
“…As a second case, when the application is designed using data-driven adaptive methodologies, such as [12,14,25,31,34], the CNN execution is sensitive to the input data complexity. To process "easy" images, they may use a lower resolution or fewer layers, whereas processing "hard" images requires more computation.…”
Section: Motivational Examplementioning
confidence: 99%
“…This may lead to increased UAV power consumption over the flight duration and, eventually, to the violation of the application power constraint, causing an emergency landing as illustrated in Figure 2(h). Thus, the methodologies in [12,14,25,31,34] are not suitable for CNN-based applications executed at the edge in changing environment, because these can neither properly adapt the application to the environment variations, nor guarantee that the application constantly meets platform-aware constraints.…”
Section: Motivational Examplementioning
confidence: 99%
“…FlexDNN [12] Vision/Classification Footprint overhead-aware design of EE-networks. DDI [76] Vision/Classification Combines layer/channel skipping with early exiting. MESS [40] Vision/Segmentation Image-level EE based on difficulty for semantic segmentation.…”
Section: Early-exiting Network-agnostic Techniquesmentioning
confidence: 99%