2021
DOI: 10.48550/arxiv.2109.10665
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Survey on Reinforcement Learning for Recommender Systems

Abstract: Recommender systems have been widely applied in different real-life scenarios to help us find useful information. Recently, Reinforcement Learning (RL) based recommender systems have become an emerging research topic. It often surpasses traditional recommendation models even most deep learning-based methods, owing to its interactive nature and autonomous learning ability. Nevertheless, there are various challenges of RL when applying in recommender systems. Toward this end, we firstly provide a thorough overvi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 124 publications
0
4
0
Order By: Relevance
“…Finally, we define the negative sampling and rewards that are suitable for this MMIR scenario (Section 3.3). [1,9,22,32]. In this scenario, the users' interactions with the recommended items (actions) are returned as feedback (the so-called observations from the environments, such as views, clicks, skips, purchases, and ratings) to the recommendation agents, which usually convert the users' feedback into a reward signal [22].…”
Section: The Gommir Modelmentioning
confidence: 99%
“…Finally, we define the negative sampling and rewards that are suitable for this MMIR scenario (Section 3.3). [1,9,22,32]. In this scenario, the users' interactions with the recommended items (actions) are returned as feedback (the so-called observations from the environments, such as views, clicks, skips, purchases, and ratings) to the recommendation agents, which usually convert the users' feedback into a reward signal [22].…”
Section: The Gommir Modelmentioning
confidence: 99%
“…As the tool for optimizing the long-term/delayed metrics [24], reinforcement learning (RL) has been widely studied for optimizing user retention in recent years [6]. Though they are capable of exploring and modeling users' dynamic interests [39], existing RL-based SRSs leave much to be desired due to the offline learning challenge.…”
Section: Introductionmentioning
confidence: 99%
“…With the development of interactive recommender systems (RSs), reinforcement learning for recommendation (RL4Rec) is receiving increased attention as reinforcement learning (RL) methods can quickly adapt to user feedback [2,32]. RL4Rec has been applied in a variety of domains, such as movie [60,62], news [68], and music recommendations [41].…”
Section: Introductionmentioning
confidence: 99%