2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

DOI: 10.1109/cvpr52688.2022.00357

|View full text |Cite

|

Sign up to set email alerts

|

Playable Environments: Video Manipulation in Space and Time

¹

,

Stéphane Lathuilière

²

,

Aliaksandr Siarohin

³

et al.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction5

Citation Types

Supporting

0

Mentioning

34

Contrasting

0

Year Published

2023

2023

2024

2024

Publication Types

Select...

Other4

Article2

Relationship

Self Cite1

Independent5

Authors

Journals

Cited by 8 publications

(34 citation statements)

References 19 publications

Supporting

0

Mentioning

34

Contrasting

0

Order By: Relevance

“…Many learnable methods focus on reducing the need for manual labor and 3D assets in computer graphics [Holden et al 2017;Kuang et al 2022;Liu et al 2021;Starke et al 2019Starke et al , 2020, but only provide narrow video game functions. More related to our work, neural video game simulation methods show that annotated videos can be used to learn to generate videos interactively [Davtyan and Favaro 2022;Kim et al , 2020Menapace et al 2021] and build 3D environments where agents can be controlled through a set of discrete actions [Menapace et al 2022]. While bringing us closer to learnable game engines, when applied to complex or realworld environments, these works have several limitations: do not accurately model game logic, do not model physical interactions of objects in 3D space, do not learn fine-grained controls, do not allow for high-level goal-driven control of the game flow, and, finally, do not model game AI.…”

Section: Introductionmentioning

confidence: 99%

“…To overcome the limitations of [Davtyan and Favaro 2022;Kim et al , 2020Menapace et al 2021Menapace et al , 2022, not only we model the states of an environment, but we also consider detailed textual representations of the actions taking place in it. We argue that training on user commentaries describing detailed actions 1 Unreal and Unity engines are used to photorealistically render environments for film production.…”

Section: Introductionmentioning

confidence: 99%

“…1. Our synthesis model maintains a state for every object and agent included in the game and renders them in the image space using the compositional NeRF [Mildenhall et al 2020] of [Menapace et al 2022] followed by a learnable enhancer for superior rendering quality. To model the logic of games and game AI that determine the evolution of the environment states, we introduce an animation model.…”

Section: Introductionmentioning

confidence: 99%

“…In particular, we show that using text labels describing actions happening in a game is instrumental in learning such capabilities. While certain prior work [Kim et al , 2020Menapace et al 2021Menapace et al , 2022 explored maintaining and rendering states of games, we are not aware of any generative method that attempts enabling fine-grained control, modeling sophisticated goal-driven game logic, and learning game AI to the extent explored in this paper.…”

Section: Introductionmentioning

confidence: 99%

“…As far as we are aware, no existing work provides this set of capabilities under comparable data assumptions. • A synthesis model, based on a compositional NeRF, producing videos at the original frame rate and doubling the output resolution with respect to [Menapace et al 2022]. • An animation model, based on a text-conditioned diffusion model with a masked training procedure, which is key to support complex game logic, object interactions, game AI, and understanding fine-grained actions.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Plotting Behind the Scenes: Towards Learnable Game Engines

et al. 2023

Preprint

View full text Add to dashboard Cite

No abstract

“…Many learnable methods focus on reducing the need for manual labor and 3D assets in computer graphics [Holden et al 2017;Kuang et al 2022;Liu et al 2021;Starke et al 2019Starke et al , 2020, but only provide narrow video game functions. More related to our work, neural video game simulation methods show that annotated videos can be used to learn to generate videos interactively [Davtyan and Favaro 2022;Kim et al , 2020Menapace et al 2021] and build 3D environments where agents can be controlled through a set of discrete actions [Menapace et al 2022]. While bringing us closer to learnable game engines, when applied to complex or realworld environments, these works have several limitations: do not accurately model game logic, do not model physical interactions of objects in 3D space, do not learn fine-grained controls, do not allow for high-level goal-driven control of the game flow, and, finally, do not model game AI.…”

Section: Introductionmentioning

confidence: 99%

“…To overcome the limitations of [Davtyan and Favaro 2022;Kim et al , 2020Menapace et al 2021Menapace et al , 2022, not only we model the states of an environment, but we also consider detailed textual representations of the actions taking place in it. We argue that training on user commentaries describing detailed actions 1 Unreal and Unity engines are used to photorealistically render environments for film production.…”

Section: Introductionmentioning

confidence: 99%

“…1. Our synthesis model maintains a state for every object and agent included in the game and renders them in the image space using the compositional NeRF [Mildenhall et al 2020] of [Menapace et al 2022] followed by a learnable enhancer for superior rendering quality. To model the logic of games and game AI that determine the evolution of the environment states, we introduce an animation model.…”

Section: Introductionmentioning

confidence: 99%

“…In particular, we show that using text labels describing actions happening in a game is instrumental in learning such capabilities. While certain prior work [Kim et al , 2020Menapace et al 2021Menapace et al , 2022 explored maintaining and rendering states of games, we are not aware of any generative method that attempts enabling fine-grained control, modeling sophisticated goal-driven game logic, and learning game AI to the extent explored in this paper.…”

Section: Introductionmentioning

confidence: 99%

“…As far as we are aware, no existing work provides this set of capabilities under comparable data assumptions. • A synthesis model, based on a compositional NeRF, producing videos at the original frame rate and doubling the output resolution with respect to [Menapace et al 2022]. • An animation model, based on a text-conditioned diffusion model with a masked training procedure, which is key to support complex game logic, object interactions, game AI, and understanding fine-grained actions.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Plotting Behind the Scenes: Towards Learnable Game Engines

et al. 2023

Preprint

View full text Add to dashboard Cite

No abstract

SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes

Tretschk,

Golyanik,

Zollhöfer

et al. 2024

2024 International Conference on 3D Vision (3DV)

View full text Add to dashboard Cite

No abstract

Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

Xia,

Lin,

Ma

et al. 2024

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

View full text Add to dashboard Cite

No abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Product

Browser Extension Assistant by scite Citation Statement Search Reference Check Visualizations Dashboards Explore Journals Explore Organizations Explore Funders Embedding Badge Embedding Citation Search Pricing

Resources

Blog Help & FAQ Accessibility Statement API Terms For Universities & Governments For Researchers For Publishers For Corporate, Pharma & Enterprise Author Marketing Become an Affiliate Get an organization trial or quote scite Data & Services

About

News & Press Careers Read our Paper Coverage

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Copyright © 2024 scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.