2020
DOI: 10.2139/ssrn.3726714
|View full text |Cite
|
Sign up to set email alerts
|

Dirichlet Policies for Reinforced Factor Portfolios

Abstract: This article aims to combine factor investing and reinforcement learning (RL). The agent learns through sequential random allocations which rely on firms' characteristics. Using Dirichlet distributions as the driving policy, we derive closed forms for the policy gradients and analytical properties of the performance measure. This enables the implementation of REINFORCE methods, which we perform on a large dataset of US equities. Across a large range of implementation choices, our result indicates that RL-based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 81 publications
0
1
0
Order By: Relevance
“…The second one, which is known as the bird in the hand theory, is based on investors' preference for dividends for the quality of certainty (Lintner, 1962). The last of the three views holds that investors are not in favor of dividend payment because the tax factor that they assume for the dividends paid is greater than the capital gain (Litzenberger & Ramaswamy, 1982;Febrianti & Zulvia, 2020;Kovalev & Drachevsky, 2020;André & Coqueret, 2020;Hakim & Kusmanto, 2020;Nguyen et al, 2021).…”
Section: Hypothesis Developmentmentioning
confidence: 99%
“…The second one, which is known as the bird in the hand theory, is based on investors' preference for dividends for the quality of certainty (Lintner, 1962). The last of the three views holds that investors are not in favor of dividend payment because the tax factor that they assume for the dividends paid is greater than the capital gain (Litzenberger & Ramaswamy, 1982;Febrianti & Zulvia, 2020;Kovalev & Drachevsky, 2020;André & Coqueret, 2020;Hakim & Kusmanto, 2020;Nguyen et al, 2021).…”
Section: Hypothesis Developmentmentioning
confidence: 99%
“…Our work handles the no-shorting, no-borrowing constraint using deep neural network approximations with a special structure for the input data. A similar investment optimization problem under Dirichlet policies has been recently studied in [51]. It is interesting to note that their algorithm also generates equal-weight strategy in most cases, although the algorithm is trained using firm-level data rather than just market data.…”
Section: Related Workmentioning
confidence: 99%