“…For removing spurious correlations, a common principle underlying past work is to make a model's prediction invariant to the features that exhibit the correlation. This can be done by data augmentation (Kaushik et al, 2019), latent space removal (Ravfogel et al, 2020), subsampling (Sagawa et al, 2019(Sagawa et al, , 2020, or sample reweighing (Mahabadi et al, 2019;Orgad and Belinkov, 2022). In many cases, however, the correlated features may be important for the task and their complete removal can cause a degradation in task performance.…”