“…Inferring the 3D shape from a single image is an inherently ill-posed task -an image of a chair from the back does not remove ambiguities about the shape of its seat. Several approaches have shown impressive single-view reconstruction results using voxels [10,14,31,38,39], point clouds [13,19,41], meshes [36,37], and most recently implicit representations of 3D surfaces like SDFs [17,43], UDFs [9] and CSPs [35]. However, these are often deterministic in nature and only generate a 3D single output.…”