“…As a result, GSU may probably miss relevant behaviors, but retrieve ones considered irrelevant by ESU, wasting ESU's precious computational resources. In such case, the TA in ESU, no matter how attention is allocated, 1 The Oracle is identically copied from the ESU of SIM-hard (trained from the top-100 behaviors retrieved by GSU), except that it ranks 10 4 rather than 10 2 behaviors. Experiments are conducted on a small demo dataset from Kuaishou, not feasible for normal datasets due to the extremely high cost of Oracle (details in Section 4.5).…”