2020
DOI: 10.1007/978-3-030-58568-6_40
|View full text |Cite
|
Sign up to set email alerts
|

Self-supervised Single-View 3D Reconstruction via Semantic Consistency

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
126
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 89 publications
(126 citation statements)
references
References 30 publications
0
126
0
Order By: Relevance
“…Since this loss is discontinuous, they use the derivative-free Nelder-Mead optimisation method. In very recent, ambitious work, Li et al [18] learn both a deformable model and model fitting in a self-supervised fashion. One of their training objectives is to ensure semantic consistency, measured by projecting the semantically labelled 3D model into the image.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Since this loss is discontinuous, they use the derivative-free Nelder-Mead optimisation method. In very recent, ambitious work, Li et al [18] learn both a deformable model and model fitting in a self-supervised fashion. One of their training objectives is to ensure semantic consistency, measured by projecting the semantically labelled 3D model into the image.…”
Section: Related Workmentioning
confidence: 99%
“…Meanwhile, SoftRas compares a soft rasterisation to hard discrete input meaning that the minimum loss does not correspond to optimal alignment. No previous work, including [32,7,18], has considered the problem of estimating shape using only semantic segmentation information.…”
Section: Related Workmentioning
confidence: 99%
“…When ground-truth 3D models are not available, dynamic neural radiance fields (NeRF) [54,43,63,61] learn an implicit scene representation from videos, but are often scene-specific and don't scale to a class-agnostic setting. Self-supervised methods [91,35,41] are promising and learn 4D reconstruction via 2D supervision of differentiable rendering [37,48]. In contrast, in this work, we present a class-agnostic and template-free framework which learns to recover the shape and dynamics from input videos.…”
Section: Geometric Representationsmentioning
confidence: 99%
“…An automatic but robust autocalibration method for multiview human performance capture [4] is urgent. Examples include 3D human reconstruction from sparse views without pose parameters [5], selfcalibration [6] for systems when there are disturbances to camera setups [7], and enhancing the reconstruction performance due to the inaccurate camera calibration [8].…”
Section: Introductionmentioning
confidence: 99%