2021 International Conference on Information and Communication Technology Convergence (ICTC) 2021
DOI: 10.1109/ictc52510.2021.9620885
|View full text |Cite
|
Sign up to set email alerts
|

Introduction to Quantum Reinforcement Learning: Theory and PennyLane-based Implementation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 35 publications
(20 citation statements)
references
References 9 publications
0
20
0
Order By: Relevance
“…This should be doable, as one knows the state preparation routine Summary. The paper by Kwak et al [Kwa+21] gives a short introduction to both RL and (variational) QC. This is followed up by a tutorial on how to implement a VQC-enhanced RL algorithm with PennyLane to solve the CartPole environment.…”
Section: Algorithmicmentioning
confidence: 99%
“…This should be doable, as one knows the state preparation routine Summary. The paper by Kwak et al [Kwa+21] gives a short introduction to both RL and (variational) QC. This is followed up by a tutorial on how to implement a VQC-enhanced RL algorithm with PennyLane to solve the CartPole environment.…”
Section: Algorithmicmentioning
confidence: 99%
“…3) Control: Quantum reinforcement learning (QRL) has been actively studied. [53], [54] have proposed the hybrid computing methods, i.e., the controller policy is based on VQC, and the evaluator-side network (i.e., critic) is based on a classical neural network. In addition, [55], [56] have proposed an utterly quantum version of the reinforcement learning regime.…”
Section: Autonomous Mobilitymentioning
confidence: 99%
“…Some works use classical-quantum hybrid model to solve large problems [23,24]. Other works use variational quantum circuit to represent policy in reinforcement learning [25,26]. Variational quantum circuit is also applied to represent both policy and value function in actor-critic method [27].…”
Section: Introductionmentioning
confidence: 99%