Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval 2020
DOI: 10.1145/3397271.3401147
|View full text |Cite
|
Sign up to set email alerts
|

Self-Supervised Reinforcement Learning for Recommender Systems

Abstract: In session-based or sequential recommendation, it is important to consider a number of factors like long-term user engagement, multiple types of user-item interactions such as clicks, purchases etc. The current state-of-the-art supervised approaches fail to model them appropriately. Casting sequential recommendation task as a reinforcement learning (RL) problem is a promising direction. A major component of RL approaches is to train the agent through interactions with the environment. However, it is often prob… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
118
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 152 publications
(128 citation statements)
references
References 34 publications
0
118
0
Order By: Relevance
“…As the research of self-supervised learning is still in its infancy, there are only several works combining it with recommender systems [24,44,45,64]. These efforts either mine self-supervision signals from future/surrounding sequential data [24,45], or mask attributes of items/users to learn correlations of the raw data [64]. However, these thoughts cannot be easily adopted to social recommendation where temporal factors and attributes may not be available.…”
Section: Self-supervised Learningmentioning
confidence: 99%
“…As the research of self-supervised learning is still in its infancy, there are only several works combining it with recommender systems [24,44,45,64]. These efforts either mine self-supervision signals from future/surrounding sequential data [24,45], or mask attributes of items/users to learn correlations of the raw data [64]. However, these thoughts cannot be easily adopted to social recommendation where temporal factors and attributes may not be available.…”
Section: Self-supervised Learningmentioning
confidence: 99%
“…States are defined in different ways in the existing literature. They can reflect a mapping of previous user-item interactions into a hidden state [15], user's recommendation and ad browsing history [13], previous items that a user clicked [12], the sequence of visited and recommended items [10] or a more detailed interaction sequence that contains clicking, purchasing, or skipping, leaving [14]. An interesting approach is to define states as the cluster resulted from the coclustering or biclustering of users and items [6] or to extend the state to include user demographics [5].…”
Section: F Recommender Systems Using Reinforcement Learningmentioning
confidence: 99%
“…Actions are mostly defined as selecting an item to be recommended from the whole discrete action space which contains the candidate items [12,14,15] or even whether to give a recommendation or not, and if yes, what would be the item to recommend [13]. There are authors that consider recommending a list of items [5,11,61].…”
Section: F Recommender Systems Using Reinforcement Learningmentioning
confidence: 99%
See 2 more Smart Citations