2022
DOI: 10.48550/arxiv.2205.11603
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improving language models fine-tuning with representation consistency targets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Additionally, for many applications such storage would not be feasible due to privacy settings, when access to past data is not available. Regularization-based approaches are more memoryefficient than replay-based approaches, but suffer from catastrophic forgetting and are often not suitable for long task sequences (Kirkpatrick et al, 2017;Razdaibiedina et al, 2022). In contrast to regularization-based and replay-based approaches, architectural CL approaches are more efficient in resolving catastrophic forgetting and, hence, are suitable for long sequences of tasks.…”
Section: Forward Transfer Experimentsmentioning
confidence: 99%
“…Additionally, for many applications such storage would not be feasible due to privacy settings, when access to past data is not available. Regularization-based approaches are more memoryefficient than replay-based approaches, but suffer from catastrophic forgetting and are often not suitable for long task sequences (Kirkpatrick et al, 2017;Razdaibiedina et al, 2022). In contrast to regularization-based and replay-based approaches, architectural CL approaches are more efficient in resolving catastrophic forgetting and, hence, are suitable for long sequences of tasks.…”
Section: Forward Transfer Experimentsmentioning
confidence: 99%