S-TRIGGER: Continual State Representation Learning via Self-Triggered Generative Replay

Caselles-Dupré, Hugo; Garcia-Ortiz, Michael; Filliat, David

doi:10.48550/arxiv.1902.09434

Cited by 3 publications

(3 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, the Exemplar-Supported Generative Reproduction model (He et al, 2018) uses a GAN to generate pseudoexamples for replay during continual learning, while the Dynamic Generative Mem-ory model (Ostapenko et al, 2019), the Deep Generative Replay model (Shin et al, 2017), the Memory Replay GAN model (Wu et al, 2018), and the Closed-Loop GAN model (Rios and Itti, 2018) are all used to continually learn to generate images and scenes. Continual learning with replay in GANs has also been used for reinforcement learning (Caselles-Dupré et al, 2019). Moreover, unsupervised learning techniques such as auto-encoders and GANs are widely used to generate replay samples in supervised learning algorithms (Draelos et al, 2017;.…”

Section: Replay In Unsupervised Learningmentioning

confidence: 99%

Replay in Deep Learning: Current Approaches and Missing Biological Elements

Hayes¹,

Bazhenov²,

Siegelmann³

et al. 2021

Preprint

View full text Add to dashboard Cite

Replay is the reactivation of one or more neural patterns, which are similar to the activation patterns experienced during past waking experiences. Replay was first observed in biological neural networks during sleep, and it is now thought to play a critical role in memory formation, retrieval, and consolidation. Replay-like mechanisms have been incorporated into deep artificial neural networks that learn over time to avoid catastrophic forgetting of previous knowledge. Replay algorithms have been successfully used in a wide range of deep learning methods within supervised, unsupervised, and reinforcement learning paradigms. In this paper, we provide the first comprehensive comparison

show abstract

Section: Replay In Unsupervised Learningmentioning

confidence: 99%

Replay in Deep Learning: Current Approaches and Missing Biological Elements

Hayes¹,

Bazhenov²,

Siegelmann³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Furthermore, the input items RePR generates are not static images but rather a sequence of consecutive frames. Since our work, pseudo-rehearsal has been used to overcome CF in models which have learnt to generate states from previously seen environments [39], [40]. In both these cases, pseudo-rehearsal was not applied to the learning agent to prevent its CF.…”

Section: Related Workmentioning

confidence: 99%

Pseudo-Rehearsal: Achieving Deep Reinforcement Learning without Catastrophic Forgetting

Atkinson,

McCane,

Szymanski

et al. 2018

Preprint

View full text Add to dashboard Cite

Neural networks can achieve extraordinary results on a wide variety of tasks. However, when they attempt to sequentially learn a number of tasks, they tend to learn the new task while destructively forgetting previous tasks. One solution to this problem is pseudo-rehearsal, which involves learning the new task while rehearsing generated items representative of previous tasks. Our model combines pseudo-rehearsal with a deep generative model and a dual memory system, resulting in a method that does not demand additional storage requirements as the number of tasks increase. Our model iteratively learns three Atari 2600 games while retaining above human level performance on all three games and performing as well as a set of networks individually trained on the tasks. This result is achieved without revisiting or storing raw data from past tasks. Furthermore, previous state-of-the-art solutions demonstrate substantial forgetting compared to our model on these complex deep reinforcement learning tasks.

show abstract

“…training procedure is presented in Algorithm 1 in Appendix C.2 For each image in a batch, we compute f (o t ) = z t and f (o t+1 ) = z t+1 using the encoder part of the VAE. Then we decode z t with the decoder and compute the reconstruction loss L reconstruction and annealed KL divergence L KL as in(Caselles-Dupré et al, 2019). Then we compute Â(a t ) • z t and compute the forward loss, which is the MSE with z t+1 : L f orward = ( Â(a t ) • z t − z t+1 ) 2 .…”

mentioning

confidence: 99%

Symmetry-Based Disentangled Representation Learning requires Interaction with Environments

Caselles-Dupré¹,

Garcia-Ortiz²,

Filliat³

2019

Preprint

Self Cite

View full text Add to dashboard Cite

Finding a generally accepted formal definition of a disentangled representation in the context of an agent behaving in an environment is an important challenge towards the construction of data-efficient autonomous agents. Higgins et al. ( 2018) recently proposed Symmetry-Based Disentangled Representation Learning, a definition based on a characterization of symmetries in the environment using group theory. We build on their work and make observations, theoretical and empirical, that lead us to argue that Symmetry-Based Disentangled Representation Learning cannot only be based on static observations: agents should interact with the environment to discover its symmetries. Our experiments can be reproduced in Colab 1 and the code is available on GitHub 2 .

show abstract

S-TRIGGER: Continual State Representation Learning via Self-Triggered Generative Replay

Cited by 3 publications

References 18 publications

Replay in Deep Learning: Current Approaches and Missing Biological Elements

Replay in Deep Learning: Current Approaches and Missing Biological Elements

Pseudo-Rehearsal: Achieving Deep Reinforcement Learning without Catastrophic Forgetting

Symmetry-Based Disentangled Representation Learning requires Interaction with Environments

Contact Info

Product

Resources

About