“…In image generation, pixel-noise robust models [10,48] have begun to be studied in recent years. More recently, label-noise robust models [35,80] have been also proposed. The primary difference is that they are image generation models (i.e., generates an image from a random noise), while our RMIT is an image-to-image translation model.…”