2014
DOI: 10.1007/978-3-319-11752-2_20
|View full text |Cite
|
Sign up to set email alerts
|

Test-Time Adaptation for 3D Human Pose Estimation

Abstract: Abstract. In this paper we consider the task of articulated 3D human pose estimation in challenging scenes with dynamic background and multiple people. Initial progress on this task has been achieved building on discriminatively trained part-based models that deliver a set of 2D body pose candidates that are then subsequently refined by reasoning in 3D [1,4,5]. The performance of such methods is limited by the performance of the underlying 2D pose estimation approaches. In this paper we explore a way to boost … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 18 publications
(50 reference statements)
0
13
0
Order By: Relevance
“…This problem corresponds to optimizing the energy function in Eq. (2). Note that using the optimization algorithm of [23] would require to construct a 3D state space that includes all 2D positions back-projected to 3D (amounting to the number of views multiplied by the size of the images) augmented with extra nodes for each back-projected node to account for occlusions.…”
Section: Multi-view Rgb-d Optimizationmentioning
confidence: 99%
See 2 more Smart Citations
“…This problem corresponds to optimizing the energy function in Eq. (2). Note that using the optimization algorithm of [23] would require to construct a 3D state space that includes all 2D positions back-projected to 3D (amounting to the number of views multiplied by the size of the images) augmented with extra nodes for each back-projected node to account for occlusions.…”
Section: Multi-view Rgb-d Optimizationmentioning
confidence: 99%
“…In a multi-view RGB system, correspondences across views are traditionally established by relying on appearance similarity and triangulation [6,14,2,7], which is unreliable in OR environments containing many surfaces that are visually similar. Instead, the depth data enables us to backproject points to 3D and is not affected by the visual appearance of the surfaces in the scene [23].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…But, these single-view datasets are lacking 3D annotations, contrary to multi-view datasets, which often come with accurate 3D body poses. As these are however much simpler for the task of 2D pose estimation [11,28], researchers have proposed methods to jointly use both single-and multi-view datasets in order to construct more robust 3D pose estimation models from multiple views [3,7]. Changes in camera setups however require the retraining of the model on training data from the same camera setup.…”
Section: Introductionmentioning
confidence: 99%
“…Related work. The fact that human appearance tends to stay unchanged through videos has been used in the past to aid pose estimation [1,36,41]. Ramanan et al [36] train discriminative body part detectors by first detecting 'easy' poses (such as a 'scissors' walking pose) and then using the appearance learnt from these poses to track the remaining video with a pictorial structure model.…”
Section: Introductionmentioning
confidence: 99%