Given recent requirements for ensuring the robustness of algorithmic trading strategies laid out in the Markets in Financial Instruments Directive II, this paper proposes a novel agentbased simulation for exploring algorithmic trading strategies. Five different types of agents are present in the market. The statistical properties of the simulated market are compared with equity market depth data from the Chi-X exchange and found to be significantly similar. The model is able to reproduce a number of stylised market properties including: clustered volatility, autocorrelation of returns, long memory in order flow, concave price impact and the presence of extreme price events. The results are found to be insensitive to reasonable parameter variations. Keywords Agent-based model • MIFiD II • Limit order book • Stylised facts • Algorithmic trading B Frank McGroarty
Abstract-For many players in financial markets, the price impact of their trading activity represents a large proportion of their transaction costs. This paper proposes a novel machine learning method for predicting the price impact of order book events. Specifically, we introduce a prediction system based on performance weighted ensembles of random forests. The system's performance is benchmarked using ensembles of other popular regression algorithms including: liner regression, neural networks and support vector regression using depth-of-book data from the BATS Chi-X exchange. The results show that recency-weighted ensembles of random forests produce over 15% greater prediction accuracy on out-of-sample data, for 5 out of 6 timeframes studied, compared with all benchmarks.
For any large player in financial markets, the impact of their trading activity represents a substantial proportion of transaction costs. This paper proposes a novel machine learning algorithm for predicting the price impact of order book events. Specifically, we introduce a prediction system based on ensembles of random forests (RFs). The system is trained and tested on depth-of-book data from the BATS and Chi-X exchanges and performance is benchmarked using ensembles of other popular regression algorithms including: linear regression, neural networks and support vector regression. The results show that recency-weighted ensembles of RFs produce over 15% greater prediction accuracy on out-of-sample data, for 5 out of 6 timeframes studied, compared with all benchmarks. Feature importance ranking is used to explore the significance of various market features on the price impact, finding them to be highly variable through time. Finally, a novel procedure for extracting the directional effects of features is proposed and used to explore the features most dominant in the price formation process.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.