Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval 2021
DOI: 10.1145/3404835.3463118
|View full text |Cite
|
Sign up to set email alerts
|

Dual Unbiased Recommender Learning for Implicit Feedback

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 13 publications
0
14
0
Order By: Relevance
“…Zhang et al [25] proposes to co-learn CTR, CVR and conducted debiasing within a multi-task learning framework. Recently, Lee et al [11] proposes a dual learning framework that simultaneously eliminates the confounding effect in clicked and unclicked data.…”
Section: Related Workmentioning
confidence: 99%
“…Zhang et al [25] proposes to co-learn CTR, CVR and conducted debiasing within a multi-task learning framework. Recently, Lee et al [11] proposes a dual learning framework that simultaneously eliminates the confounding effect in clicked and unclicked data.…”
Section: Related Workmentioning
confidence: 99%
“…Later, UBPR [29] is proposed to extend the pointwise model in Rel-MF to a pairwise version. UEBPR [12] and DU [21] introduce neighborhood-based explainability and unclicked data reweighting to the plain IPS methods, respectively. AutoDebias [6] combines IPS with data imputation and adopts a meta-learning algorithm to learn the optimal debiasing configurations on a small uniform data.…”
Section: Related Workmentioning
confidence: 99%
“…How to mitigate the bias and perform unbiased estimation of user preference has become one central theme in recommendation [7]. One prevalent solution is Inverse Propensity Scoring (IPS) [12,21,29,30], which reweights each data sample by the inverse of its propensity score (i.e., the exposure probability). Despite that IPS is unbiased in theory, it suffers from practical limitations: (1) it is challenging to accurately estimate the propensity score of each sample, since the exposure mechanism is seldom known [31]; and (2) the reweighted loss usually exhibits a high variance especially for implicit feedback [29,30,32], which implies that the losses of individual samples fluctuate heavily from their expected values.…”
Section: Introductionmentioning
confidence: 99%
“…The critical issue is how to estimate the propensity score πœ” 𝑒𝑖 from observed feedback (e.g., clicks). Previous studies [23,33,38,39,63] have developed several solutions to estimate πœ” 𝑒𝑖 . First, [23,38,39] introduced a heuristic function to estimate item popularity without using an additional model.…”
Section: Introductionmentioning
confidence: 99%
“…Previous studies [23,33,38,39,63] have developed several solutions to estimate πœ” 𝑒𝑖 . First, [23,38,39] introduced a heuristic function to estimate item popularity without using an additional model. Although intuitive, it focuses only on addressing item popularity and is thus, incapable of handling exposure bias caused by recommender models.…”
Section: Introductionmentioning
confidence: 99%