2017 IEEE Winter Conference on Applications of Computer Vision (WACV) 2017
DOI: 10.1109/wacv.2017.36
|View full text |Cite
|
Sign up to set email alerts
|

Global Model with Local Interpretation for Dynamic Shape Reconstruction

Abstract: The most standard approach to resolve the inherent ambiguities of the non-rigid structure from motion problem is using low-rank models that approximate deforming shapes by a linear combination of rigid basis. These models are typically global, i.e., each shape basis contributes equally to all points of the surface. While this approach has been shown effective to represent smooth deformations, its performance degrades for surfaces composed of various regions, each following a different deformation rule. Piecewi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
2
1

Relationship

4
2

Authors

Journals

citations
Cited by 10 publications
(11 citation statements)
references
References 44 publications
(81 reference statements)
0
11
0
Order By: Relevance
“…A and traj. B, with 28, 000 points per frame) [19], expressions (384 frames with 997 points per frame) [4], and Kinect t-shirt (313 frames with 77, 000 points) and paper (193 frames with 58, 000 points) sequences taken from [64]. In the case if 3D ground truth shapes are available, ground truth dense point tracks are obtained by a virtual orthographic camera.…”
Section: Quantitative Comparisonsmentioning
confidence: 99%
“…A and traj. B, with 28, 000 points per frame) [19], expressions (384 frames with 997 points per frame) [4], and Kinect t-shirt (313 frames with 77, 000 points) and paper (193 frames with 58, 000 points) sequences taken from [64]. In the case if 3D ground truth shapes are available, ground truth dense point tracks are obtained by a virtual orthographic camera.…”
Section: Quantitative Comparisonsmentioning
confidence: 99%
“…For every dataset, we indicate by (V/F) the number of vertices and triangular faces, respectively. We consider the face synthetic sequences denoted as Seq3 [26] (28,887/57,552), and Ogre [27] (19,985/39,856); the mocap video denoted as Mocap [21] (2,494/4,339); and finally the real face videos denoted as Face [26] (28,332/56,516), Face1 [28] (196,446/391,642) and Face2 [28] (196,446/391,614). In addition, we also consider the blending shapes (5,792/10,221) provided by [28].…”
Section: Resultsmentioning
confidence: 99%
“…A priori, no information about the video is assumed and hence the observed face could potentially undergo non-rigid motions. In the last decade, this problem has been addressed by non-rigid structure from motion approaches [19], [20], [21], showing accurate results even with incomplete 2D point tracks. However, our approach only requires a 3D model or rest shape, rather than knowing a 3D shape per image frame, which can be easily computed by rigid structure from motion techniques [22].…”
Section: From Rgb Video To 3d Modelmentioning
confidence: 99%
“…The most standard approach to address the inherent ambiguity of the NRSfM problem is by enforcing the underlying 3D shape to lie in a low-rank manifold. In order to estimate such low-rank model, factorization-based approaches have been typically used [6], [15], [26], [31], [34], [48], [56]. Other approaches impose the low-rank constraints by means of robust PCA-like formulations which seek to minimize the rank of a matrix representing the shape.…”
Section: Related Workmentioning
confidence: 99%