2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00081
|View full text |Cite
|
Sign up to set email alerts
|

Occlusion-Aware Networks for 3D Human Pose Estimation in Video

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
138
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 198 publications
(146 citation statements)
references
References 39 publications
0
138
1
Order By: Relevance
“…3 and MPI-3DHP (Tab. 4) but, interestingly, semi-supervised approaches [17,16] are the most successful on the HumanEva dataset (Tab. 5).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…3 and MPI-3DHP (Tab. 4) but, interestingly, semi-supervised approaches [17,16] are the most successful on the HumanEva dataset (Tab. 5).…”
Section: Discussionmentioning
confidence: 99%
“…Large high-quality 3D human pose estimation datasets are crucial for the success of deep learning models. The precise 3D annotations of human body joints serve as a direct supervision for models to learn how to detect the joints and resolve 2D-to-3D elevation ambiguities [30,59,17,42,25,16]. However, acquiring 3D data in the real world is challenging and is done in specially designed studios [31] and indoor environments, using wearable IMU sensors [70].…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…As previously illustrated, multi-person human pose estimation (Cheng, Yang, Wang, Yan, & Tan, 2019;Y. He, Yan, Fragkiadaki, & Yu, 2020;Iskakov, Burkov, Lempitsky, & Malkov, 2019;Lassner et al, 2017;Pavlakos et al, 2019;Pavlakos, Zhou, Derpanis, & Daniilidis, 2017;Pavllo, Feichtenhofer, Grangier, & Auli, 2019) is a central part of vision-based analysis of football video.…”
Section: Appendix a Additional Work Related To Statistical Learning In Footballmentioning
confidence: 99%
“…It was trained in a weakly supervised manner without 2D to 3D correspondences and camera parameters. Cheng et al [134] proposed a method to handle occlusion by filtering out unreliable estimates of occluded keypoints when training their 2D and 3D temporal convolutional networks.…”
Section: Human Pose Estimationmentioning
confidence: 99%