Reconstruction of articulated objects from point correspondences in a single uncalibrated image

Ce, Taylor

doi:10.1109/cvpr.2000.855885

Cited by 115 publications

(50 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A common approach with the former representation is to "lift" 2D keypoints (either ground truth or from a 2D pose detector) to 3D. This has been recently done with neural networks [28,57,31] and previously using a dictionary of 3D skeletons [38,2,59,54] or other priors [47,50,2] to constrain the problem. The point cloud representation also allows one to train a CNN to regress directly from an image (instead of 2D keypoints) to 3D joints using supervision from motion capture datasets like Human 3.6M [35,41,34].…”

Section: Related Workmentioning

confidence: 99%

“…A key challenge with 3D pose estimation in-the-wild is the lack of ground truth for people performing arbitrary, unconstrained actions in-the-wild (as typically found on images scraped from the internet). However, a suitable proxy for 3D pose estimation quality is ordinal depth [47,34] i.e. given two keypoints, predict the relative depth ordering by specifying which keypoint is in front of the other.…”

Section: Ordinal Depthmentioning

confidence: 99%

“…given two keypoints, predict the relative depth ordering by specifying which keypoint is in front of the other. This utility of this task was demonstrated by Taylor [47], who showed that the 3D skeleton of a person could be reconstructed perfectly if exact 2D keypoint correspondences, bone lengths and ordinal relations between keypoints were known, assuming an orthographic camera.…”

Section: Ordinal Depthmentioning

confidence: 99%

See 2 more Smart Citations

Exploiting Temporal Context for 3D Human Pose Estimation in the Wild

Arnab

Doersch

Zisserman

2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

234

146

View full text Add to dashboard Cite

We present a bundle-adjustment-based algorithm for recovering accurate 3D human pose and meshes from monocular videos. Unlike previous algorithms which operate on single frames, we show that reconstructing a person over an entire sequence gives extra constraints that can resolve ambiguities. This is because videos often give multiple views of a person, yet the overall body shape does not change and 3D positions vary slowly. Our method improves not only on standard mocap-based datasets like Human 3.6M -where we show quantitative improvements -but also on challenging in-the-wild datasets such as Kinetics. Building upon our algorithm, we present a new dataset of more than 3 million frames of YouTube videos from Kinetics with automatically generated 3D poses and meshes. We show that retraining a single-frame 3D pose estimator on this data improves accuracy on both real-world and mocap data by evaluating on the 3DPW and HumanEVA datasets.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Ordinal Depthmentioning

confidence: 99%

Section: Ordinal Depthmentioning

confidence: 99%

See 1 more Smart Citation

Exploiting Temporal Context for 3D Human Pose Estimation in the Wild

Arnab

Doersch

Zisserman

2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

234

146

View full text Add to dashboard Cite

show abstract

“…Our method firstly utilizes [ZFL * 10, CGZZ] to estimate mannequin 3D pose and shape (Figure 2(b)) from the input image. The 3D pose is recovered by a semi-automatic pose estimation method [Tay00] using the user-specified 2D joints. The recovered 3D orientations and rotations of skeletal bones can be interactively refined by users.…”

Section: Garment Initializationmentioning

confidence: 99%

“…where L is one point in the oriented facet F li , L li J is the 3D joints for bone l i recovered by the semi-automatic pose estimation method [Tay00], and n li is the normal of the the oriented facet F li . R li is the 3-by-3 rotation matrix of bone l i calculated by using the absolute angles of the recovery pose.…”

Section: Garment Initializationmentioning

confidence: 99%

Garment Modeling from a Single Image

Zhou

Chen

et al. 2013

Computer Graphics Forum

View full text Add to dashboard Cite

Modeling of realistic garments is essential for online shopping and many other applications including virtual characters. Most of existing methods either require a multi‐camera capture setup or a restricted mannequin pose. We address the garment modeling problem according to a single input image. We design an all‐pose garment outline interpretation, and a shading‐based detail modeling algorithm. Our method first estimates the mannequin pose and body shape from the input image. It further interprets the garment outline with an oriented facet decided according to the mannequin pose to generate the initial 3D garment model. Shape details such as folds and wrinkles are modeled by shape‐from‐shading techniques, to improve the realism of the garment model. Our method achieves similar result quality as prior methods from just a single image, significantly improving the flexibility of garment modeling.

show abstract