“…In terms of input, earlier works on probabilistic models, e.g., [FRS∗12], generates a new scene by taking a random sample from a learned distribution, while recent works on deep generative neural networks, e.g., [LPX∗19], can produce a novel scene from a random noise vector. The input can also be a hand sketch [XCF∗13], a photograph [ISS17, LZW∗15], natural language commands [MGPF∗18], or human actions/activities [FLS∗15, MLZ∗16]. In terms of output, while most methods have been designed to generate room layouts with 3D furniture objects, some methods learn to produce floor or building plans [MSK10, WFT∗19].…”