Two-Stage Textual Knowledge Distillation for End-to-End Spoken Language Understanding

Kim, Seongbin; Kim, Gyuwan; Shin, Seongjin; Lee, Sang‐Min

doi:10.1109/icassp39728.2021.9414619

Cited by 5 publications

(1 citation statement)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…By simultaneously optimizing text and image generation models, they improved the quality and consistency of image generation. Kim et al introduced a knowledge distillation method from speech to text, named Speech2Text Distillation [35] , leveraging pretrained speech recognition models to enhance text generation models. They significantly improved the performance of speech-to-text tasks through cross-modal distillation.…”

Section: Cross-modal Distillationmentioning

confidence: 99%

A Comprehensive Review of Knowledge Distillation- Methods, Applications, and Future Directions

Zhu,

Zhao,

Yang

et al. 2024

IJIRCST

View full text Add to dashboard Cite

Knowledge distillation is a model compression technique that enhances the performance and efficiency of a smaller model (student model) by transferring knowledge from a larger model (teacher model). This technique utilizes the outputs of the teacher model, such as soft labels, intermediate features, or attention weights, as additional supervisory signals to guide the learning process of the student model. By doing so, knowledge distillation reduces computational resources and storage space requirements while maintaining or surpassing the accuracy of the teacher model. Research on knowledge distillation has evolved significantly since its inception in the 1980s, especially with the introduction of soft labels by Hinton and colleagues in 2015. Various advancements have been made, including methods to extract richer knowledge, knowledge sharing among models, integration with other compression techniques, and application in diverse domains like natural language processing and reinforcement learning. This article provides a comprehensive review of knowledge distillation, covering its concepts, methods, applications, challenges, and future directions.

show abstract

Section: Cross-modal Distillationmentioning

confidence: 99%