2022
DOI: 10.1007/s10489-022-03678-y
|View full text |Cite
|
Sign up to set email alerts
|

Reminding the incremental language model via data-free self-distillation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 24 publications
0
3
0
Order By: Relevance
“…In this section, we analyze whether the improvement of Metac-Adapt comes from generating better pseudo samples. We apply the CPS [8] to evaluate the quality of pseudo samples. The CPS is a BLEU-based method that calculates sample-averaged BLEU scores between the pseudo samples and the training dataset of each learned task to obtain the distribution of knowledge of learned tasks, and then calculates the Jensen-Shannon divergence between the BLEU distribution and the uniform distribution.…”
Section: The Quality Of Pseudo Samplesmentioning
confidence: 99%
See 2 more Smart Citations
“…In this section, we analyze whether the improvement of Metac-Adapt comes from generating better pseudo samples. We apply the CPS [8] to evaluate the quality of pseudo samples. The CPS is a BLEU-based method that calculates sample-averaged BLEU scores between the pseudo samples and the training dataset of each learned task to obtain the distribution of knowledge of learned tasks, and then calculates the Jensen-Shannon divergence between the BLEU distribution and the uniform distribution.…”
Section: The Quality Of Pseudo Samplesmentioning
confidence: 99%
“…The CPS is a BLEU-based method that calculates sample-averaged BLEU scores between the pseudo samples and the training dataset of each learned task to obtain the distribution of knowledge of learned tasks, and then calculates the Jensen-Shannon divergence between the BLEU distribution and the uniform distribution. As described in [8], the lower the value of CPS represents better pseudo samples, which are beneficial to prevent catastrophic forgetting. CPS-n represents the quality of pseudo-data generated after learning n tasks.…”
Section: The Quality Of Pseudo Samplesmentioning
confidence: 99%
See 1 more Smart Citation