2020
DOI: 10.48550/arxiv.2007.03437
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Group Equivariant Deep Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 0 publications
0
5
0
Order By: Relevance
“…However, we restrict ourselves to a model-free approach where the model-based machinery presented in [ 18 ] does not apply. Finally, we mention that there is some literature on exploiting equivalence in deep RL (e.g., [ 21 , 22 ]). However, none of these works study provably efficient learning methods to our best knowledge.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, we restrict ourselves to a model-free approach where the model-based machinery presented in [ 18 ] does not apply. Finally, we mention that there is some literature on exploiting equivalence in deep RL (e.g., [ 21 , 22 ]). However, none of these works study provably efficient learning methods to our best knowledge.…”
Section: Related Workmentioning
confidence: 99%
“…As a result, the learning performance would depend on the effective size of the state space (or the effective number of unknown parameters). Various notions of structures have been studied in MDPs, which include the Lipschitz continuity of MDP parameters (e.g., rewards and transition functions) [ 10 , 11 , 12 , 13 ], factorization structure [ 14 , 15 , 16 ], and equivalence relations [ 17 , 18 , 19 , 20 , 21 , 22 ]. These works reveal that exploiting the underlying structure in the environment in various RL tasks leads to massive empirical performance gain (over structure-oblivious algorithms) and to significantly improved performance bounds.…”
Section: Introductionmentioning
confidence: 99%
“…The efficacy of equivariant deep learning has been explored in the field of medical imaging [21], object detection [22], aircraft detection [23] and reinforcement learning [24]. To the best of our knowledge, this concept has not been applied to feature description to obtain descriptors that are rotation equivariant.…”
Section: B Rotation Equivariance Networkmentioning
confidence: 99%
“…In reinforcement learning, some recent work applies equivariant models to structure-finding problems involving MDP homomorphisms [15,16]. In addition, Mondal et al [17] recently applied an E(2)-equivariant model to Q learning in an Atari game domain, but showed limited improvement. To our knowledge, equivariant model architectures have not been explored in the context of robotics applications.…”
Section: Related Workmentioning
confidence: 99%