6-DoF object pose from semantic keypoints

Pavlakos, Georgios; Zhou, Xiaowei; Chan, Aaron; Derpanis, Konstantinos G.; Daniilidis, Kostas

doi:10.1109/icra.2017.7989233

Cited by 343 publications

(269 citation statements)

References 39 publications

(54 reference statements)

Supporting

Mentioning

268

Contrasting

Order By: Relevance

“…While the keypoint matching problem can be solved using machine learning, deep CNN-based feature learning methods typically fix the 2D-3D keypoint associations and learn to predict the image locations of each corresponding 3D keypoint such as [26,25,35]. They mainly differ in model architecture and the choice of keypoints.…”

Section: Monocular Pose Estimationmentioning

confidence: 99%

“…They mainly differ in model architecture and the choice of keypoints. For in-stance, [25] uses semantic keypoints while [35] chooses the vertices of the 3D bounding box of an object. In our spaceborne scenario, objects are typically not occluded and have relatively rich texture.…”

Section: Monocular Pose Estimationmentioning

confidence: 99%

“…Under the KPEC setting, we developed a monocular pose estimation technique for space-borne objects such as satellites. Inspired by works that combine the strength of deep neural networks and geometric optimisation [26,25,35], our approach contains three main components:…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement

Chen

Cao

Parra

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

118

View full text Add to dashboard Cite

We propose an approach to estimate the 6DOF pose of a satellite, relative to a canonical pose, from a single image. Such a problem is crucial in many space proximity operations, such as docking, debris removal, and inter-spacecraft communications. Our approach combines machine learning and geometric optimisation, by predicting the coordinates of a set of landmarks in the input image, associating the landmarks to their corresponding 3D points on an a priori reconstructed 3D model, then solving for the object pose using non-linear optimisation. Our approach is not only novel for this specific pose estimation task, which helps to further open up a relatively new domain for machine learning and computer vision, but it also demonstrates superior accuracy and won the first place in the recent Kelvins Pose Estimation Challenge organised by the European Space Agency (ESA).

show abstract

Section: Monocular Pose Estimationmentioning

confidence: 99%

Section: Monocular Pose Estimationmentioning

confidence: 99%

See 1 more Smart Citation

Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement

Chen

Cao

Parra

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

118

View full text Add to dashboard Cite

show abstract

“…The prediction is further refined with independently computed viewpoints. The human pose estimation by [16] has been modified by [19,36] to detected 3D keypoints of multiple rigid classes to consequently estimate the translation and rotation of the object by fitting the keypoints into a shape model.…”

Section: Keypoint Estimationmentioning

confidence: 99%

Joint Viewpoint and Keypoint Estimation with Real and Synthetic Data

Busto¹,

Gall²

2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The estimation of viewpoints and keypoints effectively enhance object detection methods by extracting valuable traits of the object instances. While the output of both processes differ, i.e., angles vs. list of characteristic points, they indeed share the same focus on how the object is placed in the scene, inducing that there is a certain level of correlation between them. Therefore, we propose a convolutional neural network that jointly computes the viewpoint and keypoints for different object categories. By training both tasks together, each task improves the accuracy of the other. Since the labelling of object keypoints is very time consuming for human annotators, we also introduce a new synthetic dataset with automatically generated viewpoint and keypoints annotations. Our proposed network can also be trained on datasets that contain viewpoint and keypoints annotations or only one of them. The experiments show that the proposed approach successfully exploits this implicit correlation between the tasks and outperforms previous techniques that are trained independently.

show abstract

“…Due to space constraints, we concentrate our review on CNN-based methods, which can be grouped into two categories. Methods in the first category, such as [21] and [13], predict 2D keypoints from an image and then use 3D object models to predict the 3D pose given these keypoints. Methods in the second category, such as Viewpoints and Keypoints (V&K) [20] and Render-for-CNN [17], which are closer to what we do, predict 3D pose directly given an image.…”

Section: Introductionmentioning

confidence: 99%

3D Pose Regression Using Convolutional Neural Networks

Mahendran

Ali

Vidal

2017

2017 IEEE International Conference on Computer Vision Workshops (ICCVW)

View full text Add to dashboard Cite

Abstract3D pose estimation is a key component of many important computer vision tasks such as autonomous navigation and 3D scene understanding. Most state-of-the-art approaches to 3D pose estimation solve this problem as a pose-classification problem in which the pose space is discretized into bins and a CNN classifier is used to predict a pose bin. We argue that the 3D pose space is continuous and propose to solve the pose estimation problem in a CNN regression framework with a suitable representation, data augmentation and loss function that captures the geometry of the pose space. Experiments on PASCAL3D+ show that the proposed 3D pose regression approach achieves competitive performance compared to the state-of-the-art.

show abstract

6-DoF object pose from semantic keypoints

Cited by 343 publications

References 39 publications

Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement

Satellite Pose Estimation with Deep Landmark Regression and Nonlinear Pose Refinement

Joint Viewpoint and Keypoint Estimation with Real and Synthetic Data

3D Pose Regression Using Convolutional Neural Networks

Contact Info

Product

Resources

About