An End-to-End Optimal Trade Execution Framework based on Proximal Policy Optimization

Lin, Shih-Yang; Beling, Peter A.

doi:10.24963/ijcai.2020/627

Cited by 20 publications

(29 citation statements)

References 8 publications

(11 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The most popular types of RL methods that have been used in optimal execution problems are Q-learning algorithms and (double) DQN [102,159,241,108,51,181,158]. Policy-based algorithms are also popular in this field, including (deep) policy gradient methods [100,241], A2C [241], PPO [51,140], and DDPG [235]. The benchmark strategies studied in these papers include the Almgren-Chriss solution [102,100], the TWAP strategy [159,51,140], the VWAP strategy [140], and the SnL policy [158,235].…”

Section: Optimal Executionmentioning

confidence: 99%

“…Policy-based algorithms are also popular in this field, including (deep) policy gradient methods [100,241], A2C [241], PPO [51,140], and DDPG [235]. The benchmark strategies studied in these papers include the Almgren-Chriss solution [102,100], the TWAP strategy [159,51,140], the VWAP strategy [140], and the SnL policy [158,235]. In some models the trader is allowed to buy or sell the asset at each time point [108,241,217,56], whereas there are also many models where only one trading direction is allowed [158,102,100,159,51,181,235,140].…”

Section: Optimal Executionmentioning

confidence: 99%

“…The benchmark strategies studied in these papers include the Almgren-Chriss solution [102,100], the TWAP strategy [159,51,140], the VWAP strategy [140], and the SnL policy [158,235]. In some models the trader is allowed to buy or sell the asset at each time point [108,241,217,56], whereas there are also many models where only one trading direction is allowed [158,102,100,159,51,181,235,140]. The state variables are often composed of time stamp, the market attributes including (mid-)price of the asset and/or the spread, the inventory process and past returns.…”

Section: Optimal Executionmentioning

confidence: 99%

“…Experiments on real LOB data with 10 price levels show that the proposed approach significantly outperforms the existing methods, including the SnL policy (as baseline), the Q-learning algorithm, and the method in [102]. [140] proposed an adaptive framework based on PPO with neural networks including LSTM and fully-connected networks, and showed that the framework outperforms the baseline models including TWAP and VWAP, as well as several deep RL models on most of 14 US equities. [100] applied the (vanilla) policy gradient method to the LOB data of five stocks in different sectors and showed that they improve the Implementation Shortfall of the Almgren-Chriss solution by around 20%.…”

Section: Optimal Executionmentioning

confidence: 99%

See 3 more Smart Citations

Recent Advances in Reinforcement Learning in Finance

Hambly,

Xu,

Yang

2021

Preprint

View full text Add to dashboard Cite

The rapid changes in the finance industry due to the increasing amount of data have revolutionized the techniques on data processing and data analysis and brought new theoretical and computational challenges. In contrast to classical stochastic control theory and other analytical approaches for solving financial decision-making problems that heavily reply on model assumptions, new developments from reinforcement learning (RL) are able to make full use of the large amount of financial data with fewer model assumptions and to improve decisions in complex financial environments. This survey paper aims to review the recent developments and use of RL approaches in finance. We give an introduction to Markov decision processes, which is the setting for many of the commonly used RL approaches. Various algorithms are then introduced with a focus on value and policy based methods that do not require any model assumptions. Connections are made with neural networks to extend the framework to encompass deep RL algorithms. Our survey concludes by discussing the application of these RL algorithms in a variety of decision-making problems in finance, including optimal execution, portfolio optimization, option pricing and hedging, market making, smart order routing, and robo-advising.

show abstract

Section: Optimal Executionmentioning

confidence: 99%

Section: Optimal Executionmentioning

confidence: 99%

Section: Optimal Executionmentioning

confidence: 99%

Section: Optimal Executionmentioning

confidence: 99%

See 2 more Smart Citations

Recent Advances in Reinforcement Learning in Finance

Hambly,

Xu,

Yang

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…PPO is another widely used RL method for OE. Lin and Beling [78] proposed an end-to-end PPO-based framework.…”

Section: Rl In Order Executionmentioning

confidence: 99%

Reinforcement Learning for Quantitative Trading

Sun,

Wang,

2021

Preprint

View full text Add to dashboard Cite

Quantitative trading (QT), which refers to the usage of mathematical models and data-driven techniques in analyzing the financial market, has been a popular topic in both academia and financial industry since 1970s. In the last decade, reinforcement learning (RL) has garnered significant interest in many domains such as robotics and video games, owing to its outstanding ability on solving complex sequential decision making problems. RL's impact is pervasive, recently demonstrating its ability to conquer many challenging QT tasks. It is a flourishing research direction to explore RL techniques' potential on QT tasks. This paper aims at providing a comprehensive survey of research efforts on RL-based methods for QT tasks. More concretely, we devise a taxonomy of RL-based QT models, along with a comprehensive summary of the state of the art. Finally, we discuss current challenges and propose future research directions in this exciting field.

show abstract

Investigating the Robustness and Generalizability of Deep Reinforcement Learning Based Optimal Trade Execution Systems

Lin

Beling

2021

Lecture Notes in Networks and Systems

View full text Add to dashboard Cite

An End-to-End Optimal Trade Execution Framework based on Proximal Policy Optimization

Cited by 20 publications

References 8 publications

Recent Advances in Reinforcement Learning in Finance

Recent Advances in Reinforcement Learning in Finance

Reinforcement Learning for Quantitative Trading

Investigating the Robustness and Generalizability of Deep Reinforcement Learning Based Optimal Trade Execution Systems

Contact Info

Product

Resources

About