2021
DOI: 10.48550/arxiv.2101.07107
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Active High Frequency Trading

Abstract: We introduce the first end-to-end Deep Reinforcement Learning (DRL) based framework for active high frequency trading. We train DRL agents to trade one unit of Intel Corporation stock by employing the Proximal Policy Optimization algorithm. The training is performed on three contiguous months of high frequency Limit Order Book data, of which the last month constitutes the validation data. In order to maximise the signal to noise ratio in the training data, we compose the latter by only selecting training sampl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 12 publications
(15 citation statements)
references
References 29 publications
(34 reference statements)
0
11
0
Order By: Relevance
“…In subsequent continuation of this work, we can apply MBO data to various financial applications including market-making or trade execution. Further, the work of Briola et al (2021) applies Reinforcement Learning (RL) algorithms to high-frequency trading, and it would be interesting to test the effectiveness of using MBO data within a RL framework.…”
Section: Appendix a Complete Search Space For Hyperparametersmentioning
confidence: 99%
“…In subsequent continuation of this work, we can apply MBO data to various financial applications including market-making or trade execution. Further, the work of Briola et al (2021) applies Reinforcement Learning (RL) algorithms to high-frequency trading, and it would be interesting to test the effectiveness of using MBO data within a RL framework.…”
Section: Appendix a Complete Search Space For Hyperparametersmentioning
confidence: 99%
“…The main source of complexity comes from the intricate interaction of heterogeneous actors following various strategies designed to impact at different time scales. They are highly stochastic environments with a low signal to noise ratio, dominated by strong non-stationary dynamics and characterized by feedback loops and non-linear effects [2,3]. Despite their complexity, financial systems are governed by rather a stable and partially identified framework of rules [4].…”
Section: Introductionmentioning
confidence: 99%
“…The main source of complexity comes from the intricate interaction of heterogeneous actors following various strategies designed to impact at different time scales. They are highly stochastic environments with a low signal to noise ratio, dominated by strong non-stationary dynamics and characterized by feedback loops and non-linear effects [ 2 , 3 , 4 ]. Despite their complexity, financial systems are governed by a rather stable and partially identified framework of rules [ 5 ].…”
Section: Introductionmentioning
confidence: 99%