2021 IEEE 5th International Conference on Fog and Edge Computing (ICFEC) 2021
DOI: 10.1109/icfec51620.2021.00015
|View full text |Cite
|
Sign up to set email alerts
|

TOD: Transprecise Object Detection to Maximise Real-Time Accuracy on the Edge

Abstract: Real-time video analytics on the edge is challenging as the computationally constrained resources typically cannot analyse video streams at full fidelity and frame rate, which results in loss of accuracy. This paper proposes a Transprecise Object Detector (TOD) which maximises the real-time object detection accuracy on an edge device by selecting an appropriate Deep Neural Network (DNN) on the fly with negligible computational overhead. TOD makes two key contributions over the state of the art: (1) TOD leverag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 11 publications
(21 citation statements)
references
References 24 publications
0
21
0
Order By: Relevance
“…There has been a considerable body of work on exploring runtime adaptation of NNs according to given compute resource constraints [1,[25][26][27][28][29]. The single NestDNN NN model [1] switches between multiple capacities of the NN during runtime according to accuracy and inference latency requirements.…”
Section: Runtime Adaptation Of Nnsmentioning
confidence: 99%
See 1 more Smart Citation
“…There has been a considerable body of work on exploring runtime adaptation of NNs according to given compute resource constraints [1,[25][26][27][28][29]. The single NestDNN NN model [1] switches between multiple capacities of the NN during runtime according to accuracy and inference latency requirements.…”
Section: Runtime Adaptation Of Nnsmentioning
confidence: 99%
“…Similarly, Yu et al [29] proposes the Slimmable Neural Network, in which the filter parameters are shared from a smaller capacity model to increase the capacity of the NN. Another study [25] proposes to use a runtime decision mechanism to switch between multiple NNs dynamically, according to video content and computational latency, in order to improve the real-time object detection accuracy.…”
Section: Runtime Adaptation Of Nnsmentioning
confidence: 99%
“…As shown in [8], on average about 83% of runtime computations for many machine learning and deep learning applications can be approximated. This can lead to substantial savings in power consumption, for instance in [9], the authors saved 63% of the power consumption by using approximations. This adaptive approximation can be performed on the hardware, e.g., using Dynamic Partial Reconfiguration (DPR) and by instantiating of different arithmetic hardware units [10], or using dynamic change of the frequency of operations [11].…”
Section: Energy-aware Approximate Deep Learningmentioning
confidence: 99%
“…This adaptive approximation can be performed on the hardware, e.g., using Dynamic Partial Reconfiguration (DPR) and by instantiating of different arithmetic hardware units [10], or using dynamic change of the frequency of operations [11]. It could also happen at the software level, e.g., by changing the utilized machine learning algorithm based on input data, such as [9] and [12], or by changing the memory access policy used in [13]. The benefits of these approaches can be up to 2.5× savings in the energy consumption of the system.…”
Section: Energy-aware Approximate Deep Learningmentioning
confidence: 99%
“…Consider the example of a real-time video analytics application, such as identifying objects on different frames of a video stream. A different DNN model from a portfolio of models can be employed for each frame to maximise the accuracy of detection [LVWV21]. This is achieved by leveraging the meta-characteristics of each video frame, such as the size of the object and the speed of movement of the object.…”
Section: The Effect Of Workload Patternsmentioning
confidence: 99%