Fourteenth ACM Conference on Recommender Systems 2020
DOI: 10.1145/3383313.3412233
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without Communication

Abstract: With the rise of online e-commerce platforms, more and more customers prefer to shop online. To sell more products, online platforms introduce various modules to recommend items with different properties such as huge discounts. A web page often consists of different independent modules. The ranking policies of these modules are decided by different teams and optimized individually without cooperation, which might result in competition between modules. Thus, the global policy of the whole page could be sub-opti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 26 publications
(19 citation statements)
references
References 26 publications
0
19
0
Order By: Relevance
“…Generate the prototype item a p t and the recommended item a r t using Equations ( 6) and (7), respectively; Compute the losses L q , L r,c based on the minibatch, using Equations ( 17) and (20), respectively;…”
Section: Loss Function Of the Criticmentioning
confidence: 99%
See 3 more Smart Citations
“…Generate the prototype item a p t and the recommended item a r t using Equations ( 6) and (7), respectively; Compute the losses L q , L r,c based on the minibatch, using Equations ( 17) and (20), respectively;…”
Section: Loss Function Of the Criticmentioning
confidence: 99%
“…RL-based models aim to learn an optimal strategy to maximize the long terms rewards. RL-based models can be divided into three categories, the policy-based methods [4], [33], [31], [1], [12], [2], [30], the value-based methods [32], [34], [37], [11], and the actor-critic based methods [28], [6], [5], [7], [29], [24]. Chen et al [1] propose to use a balanced hierarchical clustering tree to tackle the large action space problem.…”
Section: Rl-based Recommendationmentioning
confidence: 99%
See 2 more Smart Citations
“…In recent years, to promoting the recommendation models to search meaningful paths rather than enumerate all possible paths in KGs, RL has been gradually introduced in recommendations. Some RL-based recommendation models [4,9,41] have achieved outstanding performance in recommendation. For example, Song et al [20] proposed a knowledge-aware recommendation model to generates meaningful paths from users to relevant items by learning a walking policy on the user-item-entity graph, which is designed to deal with the data sparsity and cold start problems.…”
Section: Recommendation With Rlmentioning
confidence: 99%