2022
DOI: 10.22331/q-2022-05-24-720
|View full text |Cite
|
Sign up to set email alerts
|

Quantum agents in the Gym: a variational quantum algorithm for deep Q-learning

Abstract: Quantum machine learning (QML) has been identified as one of the key fields that could reap advantages from near-term quantum devices, next to optimization and quantum chemistry. Research in this area has focused primarily on variational quantum algorithms (VQAs), and several proposals to enhance supervised, unsupervised and reinforcement learning (RL) algorithms with VQAs have been put forward. Out of the three, RL is the least studied and it is still an open question whether VQAs can be competitive with stat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
33
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 74 publications
(47 citation statements)
references
References 54 publications
0
33
0
Order By: Relevance
“…3) Hybrid-quantum PPO: Inspired by the quantum advantage shown in Refs. [10]- [12], [24], [25], this section investigates the utility of a hybrid-quantum neural network as a policy model for RL. Fig.…”
Section: Reinforcement Learningmentioning
confidence: 99%
“…3) Hybrid-quantum PPO: Inspired by the quantum advantage shown in Refs. [10]- [12], [24], [25], this section investigates the utility of a hybrid-quantum neural network as a policy model for RL. Fig.…”
Section: Reinforcement Learningmentioning
confidence: 99%
“…The best possible action solution can be estimated to maximize objective reward through learning. --- [16], [17] -- [23], [24] --this work during a finite flight time. In [11], a deep Q-network (DQN) was designed to optimize UAV navigation using a massive multiple-input-multiple-output (MIMO) technique by selecting the optimal policy.…”
Section: Introductionmentioning
confidence: 99%
“…In [23], [24], quantum reinforcement learning outperformed classical reinforcement learning by obtaining higher rewards. In [23], classical deep reinforcement learning algorithms, such as experience replay and target networks, were transformed into a representation of variational quantum circuits.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The core idea is exploiting PQCs to delegate classically difficult computation and variationally updating the parameters in PQCs with a powerful classical optimizer [28,29,30,31,32]. With error mitigation techniques [33,34,35,36,37,38,39,40], VQAs have been applied to various fields and the corresponding specific algorithms have been developed, including variational quantum eigensolver (VQE) [41,42,43,44,45], quantum approximate optimization algorithm [46,47,48,15,49], variational quantum simulator [50,51,52,53], quantum neural network (QNN) [54,55,56,57,58,59,60,61,62,63], and others [64,65,66,67,68,69,70,71,72,...…”
Section: Introduction 1backgroundmentioning
confidence: 99%