“…A significant body of work learns scene affordances, such as where a person can stand or sit, from observing data of humans [17,18,25,27,32,33,42,54,77]. Overlapping areas of work focus on human interactions with objects [12,29,52,80,85] or synthesize human motion conditioned on an input scene [10,53,74]. We propose the reverse task of hallucinating a scene conditioned on pose.…”