Self-Supervised Reinforcement Learning for Recommender Systems

Xin, Xin; Karatzoglou, Alexandros; Arapakis, Ioannis; Jose, Joemon M.

doi:10.1145/3397271.3401147

Cited by 152 publications

(128 citation statements)

References 34 publications

Supporting

Mentioning

118

Contrasting

Order By: Relevance

“…As the research of self-supervised learning is still in its infancy, there are only several works combining it with recommender systems [24,44,45,64]. These efforts either mine self-supervision signals from future/surrounding sequential data [24,45], or mask attributes of items/users to learn correlations of the raw data [64]. However, these thoughts cannot be easily adopted to social recommendation where temporal factors and attributes may not be available.…”

Section: Self-supervised Learningmentioning

confidence: 99%

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Yin

et al. 2021

Proceedings of the Web Conference 2021

274

119

View full text Add to dashboard Cite

Social relations are often used to improve recommendation quality when user-item interaction data is sparse in recommender systems. Most existing social recommendation models exploit pairwise relations to mine potential user preferences. However, real-life interactions among users are very complicated and user relations can be high-order. Hypergraph provides a natural way to model complex high-order relations, while its potentials for improving social recommendation are under-explored. In this paper, we fill this gap and propose a multi-channel hypergraph convolutional network to enhance social recommendation by leveraging high-order user relations. Technically, each channel in the network encodes a hypergraph that depicts a common high-order user relation pattern via hypergraph convolution. By aggregating the embeddings learned through multiple channels, we obtain comprehensive user representations to generate recommendation results. However, the aggregation operation might also obscure the inherent characteristics of different types of high-order connectivity information. To compensate for the aggregating loss, we innovatively integrate self-supervised learning into the training of the hypergraph convolutional network to regain the connectivity information with hierarchical mutual information maximization. The experimental results on multiple real-world datasets show that the proposed model outperforms the SOTA methods, and the ablation study verifies the effectiveness of the multi-channel setting and the selfsupervised task. The implementation of our model is available via https://github.com/Coder-Yu/RecQ.

show abstract

Section: Self-supervised Learningmentioning

confidence: 99%

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Yin

et al. 2021

Proceedings of the Web Conference 2021

274

119

View full text Add to dashboard Cite

show abstract

“…States are defined in different ways in the existing literature. They can reflect a mapping of previous user-item interactions into a hidden state [15], user's recommendation and ad browsing history [13], previous items that a user clicked [12], the sequence of visited and recommended items [10] or a more detailed interaction sequence that contains clicking, purchasing, or skipping, leaving [14]. An interesting approach is to define states as the cluster resulted from the coclustering or biclustering of users and items [6] or to extend the state to include user demographics [5].…”

Section: F Recommender Systems Using Reinforcement Learningmentioning

confidence: 99%

“…Actions are mostly defined as selecting an item to be recommended from the whole discrete action space which contains the candidate items [12,14,15] or even whether to give a recommendation or not, and if yes, what would be the item to recommend [13]. There are authors that consider recommending a list of items [5,11,61].…”

Section: F Recommender Systems Using Reinforcement Learningmentioning

confidence: 99%

“…There is a series of publications that explore the usage of RL in the area of RecSys. Out of which there are those that focus on user-item interaction sequence or user's browsing history and use it to create a state that later is fed to the RL model [10][11][12][13][14][15]. A different approach is to use user and item sets which are obtained from bi-clustering as environmental states [6].…”

Section: Introductionmentioning

confidence: 99%

“…An earlier paper is using both user information and item information vectors and refers to it as context [16]. Important work on integrating negative influence of irrelevant recommendations is done by using negative rewards [12,13,15,17].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Fairness Embedded Adaptive Recommender System: A Conceptual Framework

Popa¹

2021

IJACSA

View full text Add to dashboard Cite

In the current fast paced and constantly changing environment, companies should ensure that their way of interacting with user is both relevant and highly adaptive. In order to stay competitive, companies should invest in state-ofthe-art technologies that optimize the relationship with the user using increasingly available data. The most popular applications used to develop user relationship are Recommender Systems. The vast majority of the traditional recommender system considers recommendation as a static procedure and focus on a specific type of recommendation, being not very agile in adapting to new situations. Also, when implementing a Recommender System there is the need to ensure fairness in the way decisions are made upon customer data. In this paper, it is proposed a novel Reinforcement Learning-based recommender system that is highly adaptive to changes in customer behavior and focuses on ensuring both producer and consumer fairness, Fairness Embedded Adaptive Recommender System (FEARS). The approach overcomes Reinforcement Learning's main drawback in recommendation area by using a small, but meaningful action space. Also, there are presented two fairness metrics, their calculation and adaptation for usage with Reinforcement Learning, this way ensuring that the system gets to the optimal trade-off between personalization and fairness.

show abstract

Improving image classification robustness using self‐supervision

Wittscher

Diers

Pigorsch

2022

Stat

View full text Add to dashboard Cite

Self‐supervised learning allows training of neural networks without immense, high‐quality or labelled data sets. We demonstrate that self‐supervision furthermore improves robustness of models using small, imbalanced or incomplete data sets which pose severe difficulties to supervised models. For small data sets, the accuracy of our approach is up to 12.5% higher using MNIST and 15.2% using Fashion‐MNIST compared to random initialization. Moreover, self‐supervision influences the way of learning itself, which means that in case of small or strongly imbalanced data sets, it can be prevented that classes are not or insufficiently learned. Even if input data are corrupted and large image regions are missing from the training set, self‐supervision significantly improves classification accuracy (up to 7.3% for MNIST and 2.2% for Fashion‐MNIST). In addition, we analyse combinations of data manipulations and seek to generate a better understanding of how pretext accuracy and downstream accuracy are related. This is not only important to ensure optimal pretraining but also for training with unlabelled data in order to find an appropriate evaluation measure. As such, we make an important contribution to learning with realistic data sets and making machine learning accessible to application areas that require expensive and difficult data collection.

show abstract

Self-Supervised Reinforcement Learning for Recommender Systems

Cited by 152 publications

References 34 publications

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Fairness Embedded Adaptive Recommender System: A Conceptual Framework

Improving image classification robustness using self‐supervision

Contact Info

Product

Resources

About