2019
DOI: 10.1145/3355089.3356528
|View full text |Cite
|
Sign up to set email alerts
|

3D Ken Burns effect from a single image

Abstract: from to Fig. 1. 3D Ken Burns effect from a single image. Given a single input image and optional user annotations in form of two cropping windows, our framework animates the input image while adding parallax to synthesize a 3D Ken Burns effect. Our method works well for a wide variety of content, including portrait (top) and landscape (bottom) photos. Please refer to our supplementary video demo for more examples. Please note that this figure, as well as many other figures in this paper, contain video clips. S… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
103
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 179 publications
(110 citation statements)
references
References 72 publications
(76 reference statements)
0
103
0
Order By: Relevance
“…Normals-from-depth To evaluate normals-from-depth we use the synthetic dataset of Niklaus et al [20]. This comprises realistic scene renderings and includes depth and normal maps.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Normals-from-depth To evaluate normals-from-depth we use the synthetic dataset of Niklaus et al [20]. This comprises realistic scene renderings and includes depth and normal maps.…”
Section: Discussionmentioning
confidence: 99%
“…[14] 25.37 30.06 Table 1. Median angular error of estimated surface normals on two scenes from 3D Ken Burns dataset [20].…”
Section: Discussionmentioning
confidence: 99%
“…GridNet is a grid-like architecture of rows and columns, where each row is a stream that processes features with resolution kept unchanged, and columns connect the streams by downsampling or upsampling the features. By allowing computation to happen at different layers and different spatial scales instead of conflating layers and spatial scales (as U-Nets do) GridNet produces more accurate predictions as has been successfully applied to a number of image synthesis tasks [Niklaus and Liu 2018;Niklaus et al 2019]. We use a GridNet with eight columns wherein the first three columns perform downsampling and the remaining five columns perform upsampling, and use five rows for foreign model and six rows for facial model, as we found this to work best after an architecture search.…”
Section: Neural Network Architecture and Trainingmentioning
confidence: 99%
“…Ramamonjisoa et al [24] aims to improve predicted depth boundaries by estimating normals and edges along with depth and establishing consensus between them. Several works apply bilateral filters to increase occlusion gaps [29] or learn energy-based imagedriven refinement focusing on edges [30], [16].…”
Section: Introductionmentioning
confidence: 99%