2021
DOI: 10.1609/aaai.v35i1.16083
|View full text |Cite
|
Sign up to set email alerts
|

Universal Trading for Order Execution with Oracle Policy Distillation

Abstract: As a fundamental problem in algorithmic trading, order execution aims at fulfilling a specific trading order, either liquidation or acquirement, for a given instrument. Towards effective execution strategy, recent years have witnessed the shift from the analytical view with model-based market assumptions to model-free perspective, i.e., reinforcement learning, due to its nature of sequential decision optimization. However, the noisy and yet imperfect market information that can be leveraged by the policy has m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(13 citation statements)
references
References 34 publications
0
13
0
Order By: Relevance
“…In quantitative finance, the primary goal of the investor is to maximize the long-term value through continuously trading of multiple assets in the market [1,2]. The process consists of two parts, portfolio management, which dynamically allocate the portfolio across the assets, and order execution whose goal is to fulfill a number of acquisition or liquidation orders specified by the portfolio management strategy, within a time horizon, and close the loop of investment [3,4]. Figure (1a) presents the trading process within one trading day.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In quantitative finance, the primary goal of the investor is to maximize the long-term value through continuously trading of multiple assets in the market [1,2]. The process consists of two parts, portfolio management, which dynamically allocate the portfolio across the assets, and order execution whose goal is to fulfill a number of acquisition or liquidation orders specified by the portfolio management strategy, within a time horizon, and close the loop of investment [3,4]. Figure (1a) presents the trading process within one trading day.…”
Section: Introductionmentioning
confidence: 99%
“…Although there exists many works for order execution, few of them manage to address the above three challenges. Traditional financial model based methods [5][6][7] and some recently developed model-free reinforcement learning (RL) methods [4,8,9] only optimize the strategy for single-order execution without considering practice of multi-order execution, which would result in low trading efficacy. Moreover, it is not applicable to directly transfer the existing methods to multi-order execution since utilizing only one agent to conduct the execution of multiple orders would lead to scalability issue as the action space of one individual agent grows exponentially with the number of orders.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning (RL) has achieved remarkable progress in games [31,47,50], financial trading [8] and robotics [13]. However, in its core part, without designs tailored to specific tasks, general RL paradigms are still learning implicit representations from critic loss (value predictions) and actor loss (maximizing cumulative reward).…”
Section: Introductionmentioning
confidence: 99%
“…Market traders buy and sell volatile assets frequently, with a goal to maximize their total return. Nowadays, it is not difficult for us to find trading strategies suitable for our preferences in many academic articles or forums [1]. However, it is still a problem of how to distinguish the good and bad of these strategies and avoid making some common mistakes, such as survivorship bias, look-ahead bias, and trading cost [2].…”
Section: Introductionmentioning
confidence: 99%
“…Let RP be the score given by RP, Seq be the score of Consecutive Days and His of Historical Data. The mapping relationship is: Conf = (His + Seq + RP − 1000)/50000 + 0.94(1)…”
mentioning
confidence: 99%