Enhanced Doubly Robust Learning for Debiasing Post-Click Conversion Rate Estimation

Guo, Siyuan; Zou, Lixin; Liu, Yiding; Ye, Wenwen; Cheng, Suqi; Wang, Shuaiqiang; Chen, Hechang; Yin, Dawei; Chang, Yi

doi:10.1145/3404835.3462917

Cited by 41 publications

(59 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…MovieLens 100K 5 (ML-100K) is a dataset of 100,000 MNAR ratings from 943 users and 1,682 movies collected from movie recommendation ratings. Following the standard procedure of previous studies [27,36,22,7], we performed the following preprocessing steps to carry out the semi-synthetic experiments.…”

Section: Methodsmentioning

confidence: 99%

“…[36] proposes a doubly robust joint learning approach that improves the IPS method. A series of variants of DR methods are developed, such as more robust doubly robust (MRDR) method [7] and multi-task learning [43]. In addition, [2,13,5,37] design new debiasing algorithms via using a small uniform dataset.…”

Section: Related Workmentioning

confidence: 99%

“…Throughout, we adopt the RCT-free technique to estimate the propensity, which differs from the existing studies (e.g., Naive Bayes). In this section, following the previous studies [27,36,22,7], we aim to answer the following research question (RQ) on the semi-synthetic dataset: RQ1. Do the proposed TMLE estimators in estimating the ideal loss have both the statistical properties of relatively lower bias and variance in the presence of selection bias?…”

Section: Semi-synthetic Experimentsmentioning

confidence: 99%

“…Causal inference methods are increasingly being employed in recommender system (RS) [41], such as post-view click-through rate prediction [7], post-click conversion rate prediction [7,43], and uplift modeling [23,25,26]. Recommendation based on causality has shown its great potential in both numeric experiments and theoretical analyses in various literature [4,36,41].…”

Section: Introductionmentioning

confidence: 99%

“…In RS, data missing not at random (MNAR) is a common problem, which can be interpreted from a causal perspective as "what would the feedback be, if recommending an item to a user", requiring to answer the counterfactual problem. To address this question, many methods have been proposed, such as inverse propensity score (IPS) [24,27], self-normalized inverse propensity score (SNIPS) [27,30], error imputation based (EIB) [8,28] learning, doubly robust (DR) [36,7,5,37] learning. Among them, the DR method and its variants show superior performance.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Doubly Robust Collaborative Targeted Learning for Debiased Recommendations

Wu¹,

Li²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

In recommender systems, the feedback data received is always missing not at random (MNAR), which poses challenges for accurate rating prediction. To address this issue, many recent studies have been conducted on the doubly robust (DR) method and its variants to reduce bias. However, theoretical analysis shows that the DR method has a relatively large variance, while that of the error imputation-based (EIB) method is smaller. In this paper, we propose DR-TMLE that effectively captures the merits of both EIB and DR, by leveraging the targeted maximum likelihood estimation (TMLE) technique. DR-TMLE first obtains an initial EIB estimator and then updates the error imputation model along with the bias-reduced direction. Furthermore, we propose a novel RCT-free collaborative targeted learning algorithm for DR-TMLE, called DR-TMLE-TL, which updates the propensity model adaptively to reduce the bias of imputed errors. Both theoretical analysis and experiments demonstrate the advantages of the proposed methods compared with existing debiasing methods. * Contributed equally † Contact Author Preprint. Under review.

show abstract

Section: Methodsmentioning

confidence: 99%