Self-supervised Knowledge Distillation Using Singular Value Decomposition

Lee, Seung Hyun; Kim, Daeha; Song, Byung Cheol

doi:10.48550/arxiv.1807.06819

Cited by 1 publication

(1 citation statement)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The feature-based knowledge type aims to calculate the distillation loss from the intermediate representations of the teacher and student models [ 39 , 40 , 41 , 42 , 43 , 44 ]. The relation-based knowledge type aims to utilize the relation from the feature maps [ 45 , 46 , 47 , 48 ]. Although it has a similarity with the previous feature-based KT in the perspective of using the intermediate feature map, it is distinguished from using the manipulated function of the feature maps such as the Gram matrix [ 45 ].…”

Section: System Modelmentioning

confidence: 99%

A Method of Deep Learning Model Optimization for Image Classification on Edge Device

Lee

Lee³

2022

Sensors

View full text Add to dashboard Cite

Due to the recent increasing utilization of deep learning models on edge devices, the industry demand for Deep Learning Model Optimization (DLMO) is also increasing. This paper derives a usage strategy of DLMO based on the performance evaluation through light convolution, quantization, pruning techniques and knowledge distillation, known to be excellent in reducing memory size and operation delay with a minimal accuracy drop. Through experiments regarding image classification, we derive possible and optimal strategies to apply deep learning into Internet of Things (IoT) or tiny embedded devices. In particular, strategies for DLMO technology most suitable for each on-device Artificial Intelligence (AI) service are proposed in terms of performance factors. In this paper, we suggest a possible solution of the most rational algorithm under very limited resource environments by utilizing mature deep learning methodologies.

show abstract