“…On the other hand, a number of generative view synthesis methods have been recently proposed utilizing neural volumetric representations [6, 15, 21, 56-58, 64, 71, 80]. These methods can learn to generate 3D representations from 2D supervision, and have demonstrated impressive results on generating novel objects [61], faces [6,13,21,60], or indoor environments [15,65]. However, none of these methods can generate unbounded outdoor scenes due to lack of multiview data for supervision, and due to the larger and more complex scene geometry and appearance that is difficult to model with prior representations.…”