Language-driven synthesis of 3D scenes from scene databases

Ma, Rui; Patil, Akshay Gadi; Fisher, Matthew; Li, Manyi; Pirk, Sören; Hua, Binh-Son; Yeung, Sai-Kit; Tong, Xin; Guibas, Leonidas J.; Zhang, Hao

doi:10.1145/3272127.3275035

Cited by 68 publications

(60 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…What graph representation maps most naturally to this language? There exists prior work in language-based scene creation [Chang et al 2015[Chang et al , 2014, including recent work that uses a graph-based intermediate representation [Ma et al 2018a]. However, it constructs scenes by retrieving parts of scenes from a database; new possibilities are opened up by a system that can synthesize truly new scenes from a partial graph.…”

Section: Resultsmentioning

confidence: 99%

“…Followup work has used undirected factor graphs learned from annotated RGB-D images [Kermani et al 2016], relation graphs between objects learned from human activity annotations [Fu et al 2017], and directed graphical models with Gaussian mixtures for modeling arrangement patterns [Paul Henderson 2018]. Other work has focused on conditioning the scene generation using input from RGB-D frames [Chen et al 2014], 2D sketches of the scene [Xu et al 2013], natural language text [Chang et al 2015;Ma et al 2018b], or activity predictions on RGB-D reconstructions [Fisher et al 2015].…”

Section: Background and Related Workmentioning

confidence: 99%

See 1 more Smart Citation

PlanIT

et al. 2019

View full text Add to dashboard Cite

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Background and Related Workmentioning

confidence: 99%

PlanIT

et al. 2019

View full text Add to dashboard Cite

show abstract

“…Users can define their preference for small object arrangement interactively with our framework by manipulating the spatial relations between any two small objects. However, at the current stage, our framework cannot support advanced spatial relations such as "surrounded by" [37], which cannot be simply interpreted as multiple pairwise relations. It would be interesting to include these relations by proposing another effective data form, e.g.…”

Section: Discussion and Future Workmentioning

confidence: 96%

Active Arrangement of Small Objects in 3D Indoor Scenes

Zhang

Han

Lai

et al. 2021

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

Small object arrangement is very important for creating detailed and realistic 3D indoor scenes. In this paper, we present an interactive framework based on active learning to help users create customized arrangements for small objects according to their preferences. To achieve this with minimal user effort, we first learn the prior knowledge about small object arrangement from a 3D indoor scene dataset through a probability mining method, which forms the initial guidance for arranging small objects. Then, users are able to express their preferences on a few small object categories, which are automatically propagated to all the other categories via a novel active learning approach. In the propagation process, we introduce a novel metric to obtain the propagation weights, which measures the degree of interchangeability between two small object categories, and is calculated based on a spatial embedding model learned from the small object neighborhood information extracted from the 3D indoor scene dataset. Experiments show that our framework is able to help users effectively create customized small object arrangements with little effort.

show abstract

“…In terms of input, earlier works on probabilistic models, e.g., [FRS∗12], generates a new scene by taking a random sample from a learned distribution, while recent works on deep generative neural networks, e.g., [LPX∗19], can produce a novel scene from a random noise vector. The input can also be a hand sketch [XCF∗13], a photograph [ISS17, LZW∗15], natural language commands [MGPF∗18], or human actions/activities [FLS∗15, MLZ∗16]. In terms of output, while most methods have been designed to generate room layouts with 3D furniture objects, some methods learn to produce floor or building plans [MSK10, WFT∗19].…”

Section: Application: Indoor Scene Synthesismentioning

confidence: 99%

Learning Generative Models of 3D Structures

Chaudhuri

Ritchie

et al. 2020

Computer Graphics Forum

Self Cite

View full text Add to dashboard Cite

3D models of objects and scenes are critical to many academic disciplines and industrial applications. Of particular interest is the emerging opportunity for 3D graphics to serve artificial intelligence: computer vision systems can benefit from synthetically‐generated training data rendered from virtual 3D scenes, and robots can be trained to navigate in and interact with real‐world environments by first acquiring skills in simulated ones. One of the most promising ways to achieve this is by learning and applying generative models of 3D content: computer programs that can synthesize new 3D shapes and scenes. To allow users to edit and manipulate the synthesized 3D content to achieve their goals, the generative model should also be structure‐aware: it should express 3D shapes and scenes using abstractions that allow manipulation of their high‐level structure. This state‐of‐the‐art report surveys historical work and recent progress on learning structure‐aware generative models of 3D shapes and scenes. We present fundamental representations of 3D shape and scene geometry and structures, describe prominent methodologies including probabilistic models, deep generative models, program synthesis, and neural networks for structured data, and cover many recent methods for structure‐aware synthesis of 3D shapes and indoor scenes.

show abstract

Language-driven synthesis of 3D scenes from scene databases

Cited by 68 publications

References 35 publications

PlanIT

PlanIT

Active Arrangement of Small Objects in 3D Indoor Scenes

Learning Generative Models of 3D Structures

Contact Info

Product

Resources

About