Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

Zhao, Xiangyu; Zhang, Liang; Ding, Zhuoye; Xia, Long; Tang, Jiliang; Yin, Dawei

doi:10.1145/3219819.3219886

Cited by 297 publications

(192 citation statements)

References 35 publications

Supporting

Mentioning

182

Contrasting

Order By: Relevance

“…In recent years, deep neural network models had a great impact on learning effective feature representations in various fields, such as speech recognition [12], Computer Vision (CV) [14] and Natural Language Processing (NLP) [4]. Some recent efforts have applied deep neural networks to recommendation tasks and shown promising results [41], but most of them used deep neural networks to model audio features of music [32], textual description of items [3,33], and visual content of images [40]. Besides, NeuMF [11] presented a Neural Collaborative Filtering framework to learn the non-linear interactions between users and items.…”

Section: Related Workmentioning

confidence: 99%

Graph Neural Networks for Social Recommendation

Fan

et al. 2019

The World Wide Web Conference

Self Cite

1,364

694

View full text Add to dashboard Cite

In recent years, Graph Neural Networks (GNNs), which can naturally integrate node information and topological structure, have been demonstrated to be powerful in learning on graph data. These advantages of GNNs provide great potential to advance social recommendation since data in social recommender systems can be represented as user-user social graph and user-item graph; and learning latent factors of users and items is the key. However, building social recommender systems based on GNNs faces challenges. For example, the user-item graph encodes both interactions and their associated opinions; social relations have heterogeneous strengths; users involve in two graphs (e.g., the useruser social graph and the user-item graph). To address the three aforementioned challenges simultaneously, in this paper, we present a novel graph neural network framework (GraphRec) for social recommendations. In particular, we provide a principled approach to jointly capture interactions and opinions in the user-item graph and propose the framework GraphRec, which coherently models two graphs and heterogeneous strengths. Extensive experiments on two real-world datasets demonstrate the effectiveness of the proposed framework GraphRec. Our code is available at https: //github.com/wenqifan03/GraphRec-WWW19

show abstract

Section: Related Workmentioning

confidence: 99%

Graph Neural Networks for Social Recommendation

Fan

et al. 2019

The World Wide Web Conference

Self Cite

1,364

694

View full text Add to dashboard Cite

show abstract

“…Equation (5); 30 end 31 # Updating the S-Network. 32 for j = 1 : K do 33 Sample mini-batches of (s t , i t , r t , s t +1 ) from M; 34 Set f , d , l , r according to r t , s t +1 ; 35 Update θ s via mini-batch SGD w.r.t. the loss in Equation (7); will cause the selection base.…”

Section: Simulator Learningmentioning

confidence: 99%

“…Unfortunately, current methods including Monte Carlo (MC) and temporaldifference (TD) have limitations for offline policy learning in realistic recommender systems: MC-based methods suffer from the problem of high variance, especially when facing enormous action space (e.g., billions of candidate items) in real-world applications; TD-based methods improve the efficiency by using bootstrapping techniques in estimation, which, however, is confronted with another notorious problem called Deadly Triad (i.e., the problem of instability and divergence arises whenever combining function approximation, bootstrapping, and offline training [24]). Unfortunately, state-of-the-art methods [33,34] in recommender systems, which are designed with neural architectures, will encounter inevitably the Deadly Triad problem in offline policy learning.…”

Section: Introductionmentioning

confidence: 99%

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

Zou

Xia

Ding

et al. 2019

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining

Self Cite

187

120

View full text Add to dashboard Cite

Recommender systems play a crucial role in our daily lives. Feed streaming mechanism has been widely used in the recommender system, especially on the mobile Apps. The feed streaming setting provides users the interactive manner of recommendation in never-ending feeds. In such a manner, a good recommender system should pay more attention to user stickiness, which is far beyond classical instant metrics and typically measured by long-term user engagement. Directly optimizing long-term user engagement is a non-trivial problem, as the learning target is usually not available for conventional supervised learning methods. Though reinforcement learning (RL) naturally fits the problem of maximizing the long term rewards, applying RL to optimize long-term user engagement is still facing challenges: user behaviors are versatile to model, which typically consists of both instant feedback (e.g., clicks) and delayed feedback (e.g., dwell time, revisit); in addition, performing effective off-policy learning is still immature, especially when combining bootstrapping and function approximation.To address these issues, in this work, we introduce a RL framework -FeedRec to optimize the long-term user engagement. Fee-dRec includes two components: 1) a Q-Network which designed in hierarchical LSTM takes charge of modeling complex user behaviors, and 2) a S-Network, which simulates the environment, assists the Q-Network and voids the instability of convergence in policy learning. Extensive experiments on synthetic data and a real-world large scale data show that FeedRec effectively optimizes the longterm user engagement and outperforms state-of-the-arts. CCS CONCEPTS• Information systems → Recommender systems; Personalization; • Theory of computation → Sequential decision making. * Work performed during an internship at JD.com.

show abstract

“…This is possible because (i) edge storage and compute resources are more powerful with various system-on-chip (SoC) technologies and (ii) there is a dataprivacy practice to keep personal data locally. Further, due to its inherent capability of adaptive modeling and longterm planning, reinforcement learning presents potential in building interactive and personalized models, such as interactive recommendation systems [111], [112], [113].…”

Section: Model Training and Deploymentmentioning

confidence: 99%

The Disruptions of 5G on Data-Driven Technologies and Applications

Loghin

Cai

Chen

et al. 2020

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

With 5G on the verge of being adopted as the next mobile network, there is a need to analyze its impact on the landscape of computing and data management. In this paper, we analyze the impact of 5G on both traditional and emerging technologies and project our view on future research challenges and opportunities. With a predicted increase of 10-100x in bandwidth and 5-10x decrease in latency, 5G is expected to be the main enabler for smart cities, smart IoT and efficient healthcare, where machine learning is conducted at the edge. In this context, we investigate how 5G can help the development of federated learning. Network slicing, another key feature of 5G, allows running multiple isolated networks on the same physical infrastructure. However, security remains the main concern in the context of virtualization, multitenancy and high device density. Formal verification of 5G networks can be applied to detect security issues in massive virtualized environments. In summary, 5G will make the world even more densely and closely connected. What we have experienced in 4G connectivity will pale in comparison to the vast amounts of possibilities engendered by 5G.

show abstract

Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

Cited by 297 publications

References 35 publications

Graph Neural Networks for Social Recommendation

Graph Neural Networks for Social Recommendation

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

The Disruptions of 5G on Data-Driven Technologies and Applications

Contact Info

Product

Resources

About