3D Ken Burns effect from a single image

Niklaus, Simon; Mai, Long; Yang, Jimei; Liu, Feng

doi:10.1145/3355089.3356528

Cited by 179 publications

(110 citation statements)

References 72 publications

(76 reference statements)

Supporting

Mentioning

103

Contrasting

Order By: Relevance

“…Normals-from-depth To evaluate normals-from-depth we use the synthetic dataset of Niklaus et al [20]. This comprises realistic scene renderings and includes depth and normal maps.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Least Squares Surface Reconstruction on Arbitrary Domains

Zhu

Smith

2020

Computer Vision – ECCV 2020

View full text Add to dashboard Cite

Almost universally in computer vision, when surface derivatives are required, they are computed using only first order accurate finite difference approximations. We propose a new method for computing numerical derivatives based on 2D Savitzky-Golay filters and K-nearest neighbour kernels. The resulting derivative matrices can be used for least squares surface reconstruction over arbitrary (even disconnected) domains in the presence of large noise and allowing for higher order polynomial local surface approximations. They are useful for a range of tasks including normal-from-depth (i.e. surface differentiation), height-fromnormals (i.e. surface integration) and shape-from-x. We show how to write both orthographic or perspective height-from-normals as a linear least squares problem using the same formulation and avoiding a nonlinear change of variables in the perspective case. We demonstrate improved performance relative to state-of-the-art across these tasks on both synthetic and real data and make available an open source implementation of our method.

show abstract

“…Normals-from-depth To evaluate normals-from-depth we use the synthetic dataset of Niklaus et al [20]. This comprises realistic scene renderings and includes depth and normal maps.…”

Section: Discussionmentioning

confidence: 99%

“…[14] 25.37 30.06 Table 1. Median angular error of estimated surface normals on two scenes from 3D Ken Burns dataset [20].…”

Section: Discussionmentioning

confidence: 99%

Least Squares Surface Reconstruction on Arbitrary Domains

Zhu

Smith

2020

Computer Vision – ECCV 2020

View full text Add to dashboard Cite

show abstract

“…GridNet is a grid-like architecture of rows and columns, where each row is a stream that processes features with resolution kept unchanged, and columns connect the streams by downsampling or upsampling the features. By allowing computation to happen at different layers and different spatial scales instead of conflating layers and spatial scales (as U-Nets do) GridNet produces more accurate predictions as has been successfully applied to a number of image synthesis tasks [Niklaus and Liu 2018;Niklaus et al 2019]. We use a GridNet with eight columns wherein the first three columns perform downsampling and the remaining five columns perform upsampling, and use five rows for foreign model and six rows for facial model, as we found this to work best after an architecture search.…”

Section: Neural Network Architecture and Trainingmentioning

confidence: 99%

Portrait shadow manipulation

et al. 2020

View full text Add to dashboard Cite

Fig. 1. The results of our portrait enhancement method on real-world portrait photographs. Casual portrait photographs often suffer from undesirable shadows, particularly foreign shadows cast by external objects, and dark facial shadows cast by the face upon itself under harsh illumination. We propose an automated technique for enhancing these poorly-lit portrait photographs by removing unwanted foreign shadows, reducing harsh facial shadows, and adding synthetic fill lights. Casually-taken portrait photographs often suffer from unflattering lighting and shadowing because of suboptimal conditions in the environment. Aesthetic qualities such as the position and softness of shadows and the lighting ratio between the bright and dark parts of the face are frequently determined by the constraints of the environment rather than by the photographer. Professionals address this issue by adding light shaping tools such as scrims, bounce cards, and flashes. In this paper, we present a computational approach that gives casual photographers some of this control, thereby

show abstract

“…Ramamonjisoa et al [24] aims to improve predicted depth boundaries by estimating normals and edges along with depth and establishing consensus between them. Several works apply bilateral filters to increase occlusion gaps [29] or learn energy-based imagedriven refinement focusing on edges [30], [16].…”

Section: Introductionmentioning

confidence: 99%

Object-aware Monocular Depth Prediction with Instance Convolutions

Simsar¹,

Örnek²,

Manhardt³

et al. 2021

Preprint

View full text Add to dashboard Cite

With the advent of deep learning, estimating depth from a single RGB image has recently received a lot of attention, being capable of empowering many different applications ranging from path planning for robotics to computational cinematography. Nevertheless, while the depth maps are in their entirety fairly reliable, the estimates around object discontinuities are still far from satisfactory. This can be contributed to the fact that the convolutional operator naturally aggregates features across object discontinuities, resulting in smooth transitions rather than clear boundaries. Therefore, in order to circumvent this issue, we propose a novel convolutional operator which is explicitly tailored to avoid feature aggregation of different object parts. In particular, our method is based on estimating per-part depth values by means of superpixels. The proposed convolutional operator, which we dub "Instance Convolution", then only considers each object part individually on the basis of the estimated superpixels. Our evaluation with respect to the NYUv2 as well as the iBims dataset clearly demonstrates the superiority of Instance Convolutions over the classical convolution at estimating depth around occlusion boundaries, while producing comparable results elsewhere. Code will be made publicly available upon acceptance.

show abstract

3D Ken Burns effect from a single image

Cited by 179 publications

References 72 publications

Least Squares Surface Reconstruction on Arbitrary Domains

Least Squares Surface Reconstruction on Arbitrary Domains

Portrait shadow manipulation

Object-aware Monocular Depth Prediction with Instance Convolutions

Contact Info

Product

Resources

About