“…Learning to synthesize novel views of an object or a scene given one or more sparse observation has been widely studied in the literature [5,6,7,8,19,9,20,10,13,21,11,14]. A unifying problem definition for this set of approaches is to predict a target view given a source view/s, conditioned on a relative camera transformation.…”