Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation

Hentschel, Michael; Tsunoo, Emiru; Okuda, Takuo

doi:10.1109/icassp39728.2021.9414518

Cited by 11 publications

(2 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Introductionmentioning

confidence: 99%

“…Currently, knowledge distillation, 23 a popular technology for model enhancement, has been widely used in computer vision, natural language processing, and automatic speech recognition. 24,25 Inspired by its broad applicability, this paper incorporates knowledge distillation into GCN-based recommendation models to alleviate the above limitations. The proposed idea is different from existing work as we propose a two-phase knowledge distillation model (TKDM) to improve the effectiveness of GCN-based recommendations.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A two‐phase knowledge distillation model for graph convolutional network‐based recommendation

Huang

Lin

Gong

et al. 2022

Int J of Intelligent Sys

View full text Add to dashboard Cite

Graph convolutional network (GCN)‐based recommendation has recently attracted significant attention in the recommender system community. Although current studies propose various GCNs to improve recommendation performance, existing methods suffer from two main limitations. First, user–item interaction data is generally sparse in practice, highlighting these methods' ineffectiveness in learning user and item feature representations. Second, they usually perform a dot‐product operation to model and calculate user preferences on items, leading to inaccurate user preference learning. To address these limitations, this study adopts a design idea that sharply differs from existing works. Specifically, we introduce the knowledge distillation concept into GCN‐based recommendation and propose a two‐phase knowledge distillation model (TKDM) improving recommendation performance. In Phase I, a self‐distillation method on a graph auto‐encoder learns the user and item feature representations. This auto‐encoder employs a simple two‐layer GCN as an encoder and a fully connected layer as a decoder. On this basis, in Phase II, a mutual‐distillation method on a fully connected layer is introduced to learn user preferences on items with triple‐based Bayesian personalized ranking. Extensive experiments on three real‐world data sets demonstrate that TKDM outperforms classic and state‐of‐the‐art methods related to GCN‐based recommendation problems.

show abstract

Section: Introductionmentioning

confidence: 99%