2023
DOI: 10.48550/arxiv.2302.01680
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Two-Stage Constrained Actor-Critic for Short Video Recommendation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…A desired recommendation policy should be able to satisfy users for a long time [45]. Therefore, it is natural to involve Reinforcement Learning (RL) which is a type of Machine Learning concerned with how an intelligent agent can take actions to pursue a longterm goal [1,4,5,35]. In this setting, the recommendation process is formulated as a sequential decision process where the recommender interacts with users and receives users' online feedback (i.e., rewards) to optimize users' long-term engagement, rather than fitting a model on a set of samples based on supervised learning [7,53,62].…”
Section: Introductionmentioning
confidence: 99%
“…A desired recommendation policy should be able to satisfy users for a long time [45]. Therefore, it is natural to involve Reinforcement Learning (RL) which is a type of Machine Learning concerned with how an intelligent agent can take actions to pursue a longterm goal [1,4,5,35]. In this setting, the recommendation process is formulated as a sequential decision process where the recommender interacts with users and receives users' online feedback (i.e., rewards) to optimize users' long-term engagement, rather than fitting a model on a set of samples based on supervised learning [7,53,62].…”
Section: Introductionmentioning
confidence: 99%