2018
DOI: 10.1109/access.2018.2863540
|View full text |Cite
|
Sign up to set email alerts
|

SHPR-Net: Deep Semantic Hand Pose Regression From Point Clouds

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
64
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 82 publications
(64 citation statements)
references
References 37 publications
0
64
0
Order By: Relevance
“…In the current work, we learn encoders for RGB images and point clouds and decoders for 3D hand poses, point clouds and heat maps of the 2D hand key points on the RGB image. We choose to convert the 2.5D depth information as 3D point clouds instead of standard depth maps, due to its superior performance in hand pose estimation, as shown in previous works [10,4,6]. Heat maps are chosen as a third modality for decoding to encourage convergence of the RGB encoder, since the heat maps are closely related to activation areas on the RGB images.…”
Section: Encoder and Decoder Modulesmentioning
confidence: 99%
“…In the current work, we learn encoders for RGB images and point clouds and decoders for 3D hand poses, point clouds and heat maps of the 2D hand key points on the RGB image. We choose to convert the 2.5D depth information as 3D point clouds instead of standard depth maps, due to its superior performance in hand pose estimation, as shown in previous works [10,4,6]. Heat maps are chosen as a third modality for decoding to encourage convergence of the RGB encoder, since the heat maps are closely related to activation areas on the RGB images.…”
Section: Encoder and Decoder Modulesmentioning
confidence: 99%
“…Their method achieved satisfying performance, but tedious pre-processing steps are required, which includes oriented bounding box (OBB) calculation, surface normal estimation and k-nearest-neighbours search for all points. Chen et al improves Ge's method by using a spatial transformer network to replace the OBB and furthermore added a auxiliary hand segmentation task to improve the performance [3]. Their method can be trained end-to-end without OBB, but the segmentation ground-truth data require a extra precomputation step from the pose data.…”
Section: Deep Learning For Hand Pose Estimationmentioning
confidence: 99%
“…As a general trend, ever deeper and more sophisticated neural network architectures are dominating hand pose estimation methods. They are highly accurate [4,5,9,11,12,20,23,31,50] when trained with large amounts of labeled samples. However, given that accurate 3D annotations are extremely difficult to obtain, a number of works approach the problem with deep generative models to leverage unlabelled data [1,3,21,28,29,36,49].…”
Section: Related Workmentioning
confidence: 99%