“…In RS, data missing not at random (MNAR) is a common problem, which can be interpreted from a causal perspective as "what would the feedback be, if recommending an item to a user", requiring to answer the counterfactual problem. To address this question, many methods have been proposed, such as inverse propensity score (IPS) [24,27], self-normalized inverse propensity score (SNIPS) [27,30], error imputation based (EIB) [8,28] learning, doubly robust (DR) [36,7,5,37] learning. Among them, the DR method and its variants show superior performance.…”