2019
DOI: 10.48550/arxiv.1911.04231
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Abstract: In this work, we present a novel data-driven method for robust 6DoF object pose estimation from a single RGBD image. Unlike previous methods that directly regressing pose parameters, we tackle this challenging task with a keypointbased approach. Specifically, we propose a deep Hough voting network to detect 3D keypoints of objects and then estimate the 6D pose parameters within a least-squares fitting manner. Our method is a natural extension of 2Dkeypoint approaches that successfully work on RGB based 6DoF es… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 59 publications
1
10
0
Order By: Relevance
“…The latter can also be verified inFigures 20,21,23,24 and 22 present our collected trajectories in various buildings for the train (20, 21 with red trajectories), test (23, 24 with yellow trajectories) and validation (22 with green trajectories) sets respectively. The gamification approach allowed us to collect realistic trajectories enabling our dataset to be used in different computer vision, and robotic tasks (i.e.…”
supporting
confidence: 61%
See 1 more Smart Citation
“…The latter can also be verified inFigures 20,21,23,24 and 22 present our collected trajectories in various buildings for the train (20, 21 with red trajectories), test (23, 24 with yellow trajectories) and validation (22 with green trajectories) sets respectively. The gamification approach allowed us to collect realistic trajectories enabling our dataset to be used in different computer vision, and robotic tasks (i.e.…”
supporting
confidence: 61%
“…PVNet [48] votes for keypoints, which are then used to estimate the object's 6DOF pose using PnP. Following a dense correspondence approach, various works directly regress either 2D image to 3D model correspondences [71], normalized object coordinates [65], or leverage deep Hough voting as in the case of PVN3D [23]. Regarding coordinates based regression, separate branches for the translation and rotational components were found to be more robust in predict pose [36], especially for occluded and textureless objects.…”
Section: Related Workmentioning
confidence: 99%
“…More recent approaches regress object 3D coordinates at each pixel [27,39] in a normalized space to then fit the pose of the objects using their 3D representations. Finally, PVN3D [15] relies on deep Hough voting to regress 3D keypoints directly and then fits the pose through least squares optimization. In this work, we bridge the task of semantic segmentation of an object and that of pose estimation by adding geometric constraints during training of the segmentation network.…”
Section: Related Workmentioning
confidence: 99%
“…The depth map captured by RGB-D camera preserves important geometry information of the given scene, allowing 2D algorithms to be extend into 3D space. Depth awareness has been proven to be crucial for many applications of scene understanding, e.g., scene parsing [62,30], 6D object pose estimation [59,28] and object detection [25,50], leading to a significant performance enhancement. Recently, there have been a few attempts to take into account the 3D geometric information for salient object detection in the given scene, e.g., by distilling prior knowledge from the depth [52] or incorporating depth information into a SOD framework [87,49,21].…”
Section: Introductionmentioning
confidence: 99%