2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.602
|View full text |Cite
|
Sign up to set email alerts
|

3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images

Abstract: We propose a simple, yet effective approach for real-time hand pose estimation from single depth images using threedimensional Convolutional Neural Networks (3D CNNs). Image based features extracted by 2D CNNs are not directly suitable for 3D hand pose estimation due to the lack of 3D spatial information. Our proposed 3D CNN taking a 3D volumetric representation of the hand depth image as input can capture the 3D spatial structure of the input and accurately regress full 3D hand pose in a single pass. In order… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
240
0
3

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 254 publications
(247 citation statements)
references
References 37 publications
0
240
0
3
Order By: Relevance
“…The paid efforts can be generally categorized into 3D CNN based and point-set based families. 3D CNN based methods [16,10,26] voxelizes the depth image into volumetric representation (e.g., occupancy grid models [24]). 3D convolution or deconvolution operation is then executed to capture 3D visual characteristics.…”
Section: Related Workmentioning
confidence: 99%
“…The paid efforts can be generally categorized into 3D CNN based and point-set based families. 3D CNN based methods [16,10,26] voxelizes the depth image into volumetric representation (e.g., occupancy grid models [24]). 3D convolution or deconvolution operation is then executed to capture 3D visual characteristics.…”
Section: Related Workmentioning
confidence: 99%
“…hand gesture as a primary interface for AR/VR. The problem is challenging due to high dimensionality of hand space, pose and shape variations, self-occlusions, etc [52,43,58,65,10,61,40,5,12,63,47,59,37,67,54,53,9,44,31,49,32,20,27,33,7,64]. Most existing methods have focused on recovering sparse hand poses i.e.…”
Section: Introductionmentioning
confidence: 99%
“…DISCO Nets [2] (NIPS 2016) 20.7 Crossing Nets [39] (CVPR 2017) 15.5 LSPS [1] (BMVC 2018) 15.4 Weak supervision [22] (CVIU 2017) 14.8 Lie-X [45] (IJCV 2017) 14.5 3DCNN [7] (CVPR 2017) 14.1 REN-9x6x6 [41] (JVCI 2018) 12.7 DeepPrior++ [23] (ICCVw 2017) 12.3 Pose Guided REN [3] (Neurocomputing 2018) 11.8 SHPR-Net [4] (IEEE Access 2018) 10.8 Hand PointNet [6] (CVPR 2018) 10.5 Dense 3D regression [40] (CVPR 2018) 10.2 V2V single model [20] (CVPR 2018) 9.2 V2V ensemble [20] (CVPR 2018) 8.4 Feature mapping [29] The comparisons in this section are based upon the numbers published by the authors. That is, these comparisons disregard differences in the used data subsamples, models, architectures, and other specificities.…”
Section: Me (Mm)mentioning
confidence: 99%