Quantitative Trading through Random Perturbation Q-Network with Nonlinear Transaction Costs

Zhu, Tian; Wei, Zhu

doi:10.3390/stats5020033

Cited by 6 publications

(5 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…DRA (Briola et al 2021) uses LSTM and PPO. CDQNRP (Zhu and Zhu 2022) uses a random perturbation to increase the stability of training a DQN. However, these algorithms focus mainly on designing only one RL agent to conduct profitable trading in short-term scenarios, neglecting its failure to maintain performance over long periods.…”

Section: Rl For Quantitative Tradingmentioning

confidence: 99%

See 1 more Smart Citation

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

Qin,

Sun,

Zhang

et al. 2024

AAAI

View full text Add to dashboard Cite

High-frequency trading (HFT) is using computer algorithms to make trading decisions in short time scales (e.g., second-level), which is widely used in the Cryptocurrency (Crypto) market, (e.g., Bitcoin). Reinforcement learning (RL) in financial research has shown stellar performance on many quantitative trading tasks. However, most methods focus on low-frequency trading, e.g., day-level, which cannot be directly applied to HFT because of two challenges. First, RL for HFT involves dealing with extremely long trajectories (e.g., 2.4 million steps per month), which is hard to optimize and evaluate. Second, the dramatic price fluctuations and market trend changes of Crypto make existing algorithms fail to maintain satisfactory performances. To tackle these challenges, we propose an Efficient hieArchical Reinforcement learNing method for High Frequency Trading (EarnHFT), a novel three-stage hierarchical RL framework for HFT. In stage I, we compute a Q-teacher, i.e., the optimal action value based on dynamic programming, for enhancing the performance and training efficiency of second level RL agents. In stage II, we construct a pool of diverse RL agents for different market trends, distinguished by return rates, where hundreds of RL agents are trained with different preferences of return rates and only a tiny fraction of them will be selected into the pool based on their profitability. In stage III, we train a minute-level router which dynamically picks a second-level agent from the pool to achieve stable performance across different markets. Through extensive experiments in various market trends on Crypto markets in a high-fidelity simulation trading environment, we demonstrate that EarnHFT significantly outperforms 6 state-of-art baselines in 6 popular financial criteria, exceeding the runner-up by 30% in profitability.

show abstract

Section: Rl For Quantitative Tradingmentioning

confidence: 99%

“…PPO and DQN. 4 DRA and CDQNRP• CDQNRP(Zhu and Zhu 2022) uses a random perturbed target frequency to enhance the stability during training. • MACD(Krug, Dobaj, and Macher 2022) is an upgraded method based on the traditional moving average method.…”

mentioning

confidence: 99%

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

Qin,

Sun,

Zhang

et al. 2024

AAAI

View full text Add to dashboard Cite

show abstract

“…In Equation (42), 𝜋𝜋(𝑠𝑠 𝑡𝑡 (𝑛𝑛)|𝛉𝛉 𝑎𝑎 ) is the output of DDPG's actor. DDPG uses a replay buffer ℬ that includes samples from older policies.…”

Section: Replay Buffermentioning

confidence: 99%

“…Other domains where RL has been used include hospital decision making [37], precision agriculture [38], and fluid mechanics [39]. The financial industry is another important sector where RL has been adopted for several scenarios [40][41][42]. It is of little surprise that RL has been extensively used to solve various problems in energy systems [43][44][45][46][47].…”

Section: Introductionmentioning

confidence: 99%

Reinforcement Learning: Theory and Applications in HEMS

Alani¹,

Das²

2022

Preprint

View full text Add to dashboard Cite

The steep rise in reinforcement learning (RL) in various applications in energy as well as the penetration of home automation in recent years are the motivation for this article. It surveys the use of RL in various home energy management system (HEMS) applications. There is a focus on deep neural network (DNN) models in RL. The article provides an overview of reinforcement learning. This is followed with discussions on state-of-the-art methods for value, policy, and actor–critic methods in deep reinforcement learning (DRL). In order to make the published literature in reinforcement learning more accessible to the HEMS community, verbal descriptions are accompanied with explanatory figures as well as mathematical expressions using standard machine learning terminology. Next, a detailed survey of how reinforcement learning is used in different HEMS domains is described. The survey also considers what kind of reinforcement learning algorithms are used in each HEMS application. It suggests that research in this direction is still in its infancy. Lastly, the article proposes four performance metrics to evaluate RL methods.

show abstract

“…Reinforcement learning (RL) serves as an important branch of machine learning. As a powerful approach in decision and control theory, RL has attracted extensive focus, with wide applications in the fields of robotics 1 , quantitative finance 2 , computer vision 3 , healthcare 4 , career planning 5 , gaming 6 etc. The most common object of a games is to beat the opponents, whether they are computers or other human players.…”

Section: Introductionmentioning

confidence: 99%

Optimal strategy of the simultaneous dice game Pig for multiplayers: when reinforcement learning meets game theory

Zhu

Chen

et al. 2023

Sci Rep

Self Cite

View full text Add to dashboard Cite

In this work, we focus on using reinforcement learning and game theory to solve for the optimal strategies for the dice game Pig, in a novel simultaneous playing setting. First, we derived analytically the optimal strategy for the 2-player simultaneous game using dynamic programming, mixed-strategy Nash equilibrium. At the same time, we proposed a new Stackelberg value iteration framework to approximate the near-optimal pure strategy. Next, we developed the corresponding optimal strategy for the multiplayer independent strategy game numerically. Finally, we presented the Nash equilibrium for simultaneous Pig game with infinite number of players. To help promote the learning of and interest in reinforcement learning, game theory and statistics, we have further implemented a website where users can play both the sequential and simultaneous Pig game against the optimal strategies derived in this work.

show abstract

Quantitative Trading through Random Perturbation Q-Network with Nonlinear Transaction Costs

Cited by 6 publications

References 13 publications

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

Reinforcement Learning: Theory and Applications in HEMS

Optimal strategy of the simultaneous dice game Pig for multiplayers: when reinforcement learning meets game theory

Contact Info

Product

Resources

About