“…There have been a number of theorectical work on DRO and Optimal Transport, see [9,8,22,48,40,55,58]. In particular, [26,50,28,27,6] study the theory and applications of DRO problems using Wasserstein distance to parameterize the constraint set. [59] generalizes models to unseen domains by training the models with DRO.…”