Self-supervised Outdoor Scene Relighting

Yu, Ye; Meka, Abhimitra; Elgharib, Mohamed; Seidel, Hans‐Peter; Theobalt, Christian; Smith, William A. P.

doi:10.1007/978-3-030-58542-6_6

Cited by 40 publications

(64 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Still, the illumination estimation is only considered for specific objects rather than natural scenes. Given multi-view images, Yu et al [42] proposed the first single imagebased outdoor scene relighting method along with lighting estimation for the scene. They used the spherical harmonics lighting [28] to generate the shading and it could not handle cast shadows caused by occlusion.…”

Section: Image Relightingmentioning

confidence: 99%

“…They used the spherical harmonics lighting [28] to generate the shading and it could not handle cast shadows caused by occlusion. Moreover, it is worth noting that although these relighting methods [19,20,31,34,38,42,46] with illumination estimation can be applied to image harmonization, additional computational overhead would be introduced, since illumination estimation for the background image is often accompanied by the estimation of other physical attributes in the background image. In other words, these relighting methods are not specifically designed for image harmonization.…”

Section: Image Relightingmentioning

confidence: 99%

See 1 more Smart Citation

NeurSF: Neural Shading Field for Image Harmonization

Hu¹,

Elie²,

Wang³

et al. 2021

Preprint

View full text Add to dashboard Cite

Image harmonization aims at adjusting the appearance of the foreground to make it more compatible with the background. Due to a lack of understanding of the background illumination direction, existing works are incapable of generating a realistic foreground shading. In this paper, we decompose the image harmonization into two sub-problems: 1) illumination estimation of background images and 2) rendering of foreground objects. Before solving these two sub-problems, we first learn a direction-aware illumination descriptor via a neural rendering framework, of which the key is a Shading Module that decomposes the shading field into multiple shading components given depth information. Then we design a Background Illumination Estimation Module to extract the direction-aware illumination descriptor from the background. Finally, the illumination descriptor is used in conjunction with the neural rendering framework to generate the harmonized foreground image containing a novel harmonized shading. Moreover, we construct a photo-realistic synthetic image harmonization dataset that contains numerous shading variations by image-based lighting. Extensive experiments on this dataset demonstrate the effectiveness of the proposed method. Our dataset and code will be made publicly available.

show abstract

Section: Image Relightingmentioning

confidence: 99%

Section: Image Relightingmentioning

confidence: 99%

NeurSF: Neural Shading Field for Image Harmonization

Hu¹,

Elie²,

Wang³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…However, the complexity in acquiring controlled multiple images of the same real-world object has led these models to be trained again only on synthetic data. Some recent works leverage photo collections of real scenes [22,48,47,27], but are often restricted to famous landmarks or street view imagery. Learning from unannotated image collections.…”

Section: Related Workmentioning

confidence: 99%

De-rendering the World's Revolutionary Artefacts

Makadia

Snavely

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Diverse photo collections of landmarks are unified by the underlying 3D scene geometry, despite the fact that a scene can look dramatically different from one image to the next due to varying illumination, alternating seasons, or special events. This geometric anchoring can be exploited when learning a range of geometry-related vision tasks, such as novel view synthesis [35,29], singleview depth prediction [28], and relighting [60,59], that require large amounts of diverse training data. However, prior work on tourist photos of landmarks has focused almost exclusively on lower-level reconstruction tasks, and not on higher-level scene understanding or recognition tasks.…”

Section: Introductionmentioning

confidence: 99%

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision

Wu¹,

Averbuch‐Elor²,

Sun³

et al. 2021

Preprint

View full text Add to dashboard Cite

Figure 1: Our WikiScenes dataset combines 3D reconstructions, images, and language descriptions for dozens of landmarks, like the Barcelona and Reims Cathedrals pictured above. WikiScenes enables new tasks that combine different modalities, such as associating semantic concepts like "portal", "facade", and "tower" (colored in pink, blue and brown, respectively) with 3D structure across all cathedrals.

show abstract

Self-supervised Outdoor Scene Relighting

Cited by 40 publications

References 57 publications

NeurSF: Neural Shading Field for Image Harmonization

NeurSF: Neural Shading Field for Image Harmonization

De-rendering the World's Revolutionary Artefacts

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision

Contact Info

Product

Resources

About