Pseudorehearsal in Value Function Approximation

Marochko, Vladimir; Johard, Leonard; Mazzara, Manuel

doi:10.1007/978-3-319-59394-4_18

Cited by 2 publications

(2 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…PR is a simple and computationally efficient method for solving CF problem which is proven to be successful in unsupervised learning [17], supervised learning problems [21], [16] and sometimes in reinforcement learning as well [22], [14], [23]. It is interesting to note that the results of Baddeley suggest, that the widely studied ill conditioning might not be the main bottleneck of reinforcement learning while CF may be.…”

Section: Pseudorehearsalmentioning

confidence: 99%

Pseudorehearsal in Actor-Critic Agents with Neural Network Function Approximation

Marochko

Johard

Mazzara

et al. 2018

2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)

Self Cite

View full text Add to dashboard Cite

Catastrophic forgetting has a significant negative impact in reinforcement learning. The purpose of this study is to investigate how pseudorehearsal can change performance of an actor-critic agent with neural-network function approximation. We tested agent in a pole balancing task and compared different pseudorehearsal approaches. We have found that pseudorehearsal can assist learning and decrease forgetting.

show abstract

Section: Pseudorehearsalmentioning

confidence: 99%

Pseudorehearsal in Actor-Critic Agents with Neural Network Function Approximation

Marochko

Johard

Mazzara

et al. 2018

2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)

Self Cite

View full text Add to dashboard Cite

show abstract

“…We have shown that in Q-learning algorithms pseudorehearsal can improve performance significantly. [1] and now want to test it on more interesting and complex actor-critic algorithm. Actor-critic methods are one of the types of reinforcement learning model-based algorithms based on TD-learning.…”

Section: Introductionmentioning

confidence: 99%

Pseudorehearsal in actor-critic agents

Vladimir,

Johard,

Mazzara

2017

Preprint

Self Cite

View full text Add to dashboard Cite

Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is generally sparse and non-stationary over time. The purpose of this study is to investigate whether pseudorehearsal can increase performance of an actor-critic agent with neural-network based policy selection and function approximation in a pole balancing task and compare different pseudorehearsal approaches. We expect that pseudorehearsal assists learning even in such very simple problems, given proper initialization of the rehearsal parameters.

show abstract

Pseudorehearsal in Value Function Approximation

Cited by 2 publications

References 23 publications

Pseudorehearsal in Actor-Critic Agents with Neural Network Function Approximation

Pseudorehearsal in Actor-Critic Agents with Neural Network Function Approximation

Pseudorehearsal in actor-critic agents

Contact Info

Product

Resources

About