2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00869
|View full text |Cite
|
Sign up to set email alerts
|

Augmented Skeleton Space Transfer for Depth-Based Hand Pose Estimation

Abstract: Crucial to the success of training a depth-based 3D hand pose estimator (HPE) is the availability of comprehensive datasets covering diverse camera perspectives, shapes, and pose variations. However, collecting such annotated datasets is challenging. We propose to complete existing databases by generating new database entries. The key idea is to synthesize data in the skeleton space (instead of doing so in the depth-map space) which enables an easy and intuitive way of manipulating data entries. Since the skel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
55
0
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 80 publications
(56 citation statements)
references
References 53 publications
(100 reference statements)
0
55
0
1
Order By: Relevance
“…Further, deep learning methods [65,30,50,58,30,64,28,38,64,2] have been successfully applied along with the million-scale hand pose dataset [65] in this domain. Recently, Malik et al [26] proposed to estimate both 3D skeletal and mesh representations and have obtained improved accuracy in 3D skeletal estimation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Further, deep learning methods [65,30,50,58,30,64,28,38,64,2] have been successfully applied along with the million-scale hand pose dataset [65] in this domain. Recently, Malik et al [26] proposed to estimate both 3D skeletal and mesh representations and have obtained improved accuracy in 3D skeletal estimation.…”
Section: Related Workmentioning
confidence: 99%
“…1) is important as it helps understand e.g. human-object interactions [7,6,3,1] and perform robotic Discriminative methods based on convolutional neural networks (CNNs) have shown very promising performance in estimating 3D hand poses either from RGB images [43,68,4,14,29,46] or depth maps [65,30,50,58,30,64,28,38,64,2]. However, the predictions are based on coarse skeletal representations, and no explicit kinematics and geometric mesh constraints are often considered.…”
Section: Introductionmentioning
confidence: 99%
“…To generate continuous representations from categorical representations and their word representations, we first map all the attributes to their corresponding word representations using the pre-trained word2vec models. For our experiments we use Glove [23], publicly available representations of words 3 . However, word representations other than this can be easily used in our method.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…To tackle this issue, data augmentation methods [16] propose to add invariance in input space by generating synthetic examples by applying several geometric transformations such as translation, rotation, flipping and random cropping. GAN synthetic data [12,3] is also being used to improve the generalisation capability of the networks. Similarly, Dropout [28] proposes to add invariance on activation layers by randomly dropping out neurons.…”
Section: Related Workmentioning
confidence: 99%
“…Another line of research for data augmentation is the use of large synthetic data generated by GANs (Baek, Kim, and Kim 2018;Shrivastava et al 2017;Zheng, Zheng, and Yang 2017;Gecer et al 2018). In these methods, synthetic data are used to augment real data but randomly when training CNNs.…”
mentioning
confidence: 99%