2022
DOI: 10.1016/j.ins.2022.07.085
|View full text |Cite
|
Sign up to set email alerts
|

Target layer regularization for continual learning using Cramer-Wold distance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 7 publications
0
1
0
Order By: Relevance
“…For example, the elastic weight integration (EWC) [36] measures the sensitivity of the parameters relative to each task through the Fisher information matrix of the parameters calculated by KL divergence and indicates which parameters need to be retained most to avoid forgetting old tasks. Mazur et al [48] proposed the idea of CW-TaLaR, similar to EWC, using the Cremer-Will distance (instead of the KL divergence) to calculate the penalty term directly. Synaptic intelligence (SI) [37] calculates the path integral of the motion trajectory of parameters in the training process and takes the absolute integral as an indicator to measure the importance of parameters, which can be represented by the local contribution of each parameter to the overall loss change in the training process.…”
Section: Continuous Learning Methods Based On Parameter Importance Ca...mentioning
confidence: 99%
“…For example, the elastic weight integration (EWC) [36] measures the sensitivity of the parameters relative to each task through the Fisher information matrix of the parameters calculated by KL divergence and indicates which parameters need to be retained most to avoid forgetting old tasks. Mazur et al [48] proposed the idea of CW-TaLaR, similar to EWC, using the Cremer-Will distance (instead of the KL divergence) to calculate the penalty term directly. Synaptic intelligence (SI) [37] calculates the path integral of the motion trajectory of parameters in the training process and takes the absolute integral as an indicator to measure the importance of parameters, which can be represented by the local contribution of each parameter to the overall loss change in the training process.…”
Section: Continuous Learning Methods Based On Parameter Importance Ca...mentioning
confidence: 99%