Segmentation-Driven 6D Object Pose Estimation

Hu, Yinlin; Hugonot, Joachim; Fua, Pascal; Salzmann, Mathieu

doi:10.1109/cvpr.2019.00350

Cited by 273 publications

(247 citation statements)

References 46 publications

Supporting

Mentioning

242

Contrasting

Unclassified

Order By: Relevance

“…A similar approach that is better suited for cluttered scenes divides images into patches where the corresponding object and 2D projections are predicted for each patch. Predictions are propagated across patches and used to build a robust set of 3D-to-2D correspondences [17]. Finally, feature representations are widely used.…”

Section: Related Workmentioning

confidence: 99%

“…While the feature-based methods typically output the wanted pose directly, methods that produce intermediate representations must employ a final step to produce usable poses. For going from key points to 6-DoF poses, this is typically achieved using a Perspective-n-Point (PnP) algorithm [16,17]. A recent comprehensive survey that covered the different aspects involved in robotic grasping can be found in [23].…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Cutting Pose Prediction from Point Clouds

Philipsen

Moeslund

2020

Sensors

View full text Add to dashboard Cite

The challenge of getting machines to understand and interact with natural objects is encountered in important areas such as medicine, agriculture, and, in our case, slaughterhouse automation. Recent breakthroughs have enabled the application of Deep Neural Networks (DNN) directly to point clouds, an efficient and natural representation of 3D objects. The potential of these methods has mostly been demonstrated for classification and segmentation tasks involving rigid man-made objects. We present a method, based on the successful PointNet architecture, for learning to regress correct tool placement from human demonstrations, using virtual reality. Our method is applied to a challenging slaughterhouse cutting task, which requires an understanding of the local geometry including the shape, size, and orientation. We propose an intermediate five-Degree of Freedom (DoF) cutting plane representation, a point and a normal vector, which eases the demonstration and learning process. A live experiment is conducted in order to unveil issues and begin to understand the required accuracy. Eleven cuts are rated by an expert, with 8 / 11 being rated as acceptable. The error on the test set is subsequently reduced through the addition of more training data and improvements to the DNN. The result is a reduction in the average translation from 1.5 cm to 0.8 cm and the orientation error from 4.59° to 4.48°. The method’s generalization capacity is assessed on a similar task from the slaughterhouse and on the very different public LINEMOD dataset for object pose estimation across view points. In both cases, the method shows promising results. Code, datasets, and other materials are available in Supplementary Materials.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Cutting Pose Prediction from Point Clouds

Philipsen

Moeslund

2020

Sensors

View full text Add to dashboard Cite

show abstract

“…Recent approaches, such as R-CNN [17], Fast R-CNN [18], Faster R-CNN [19] and YOLO [20], show amazing performance on detection task. As for Key-points localization problem, it has attracted considerable study in recent years [21] [12] and precise model of target to be detected [16] [36]. As for aircraft pose estimation situation, the depth information is hard to collect due to the long distance between the sensor and the target, also the model of the object is not available if it's non-cooperative object.…”

Section: A Object Detection and 2d Key-points Localizationmentioning

confidence: 99%

2D-Key-Points-Localization-Driven 3D Aircraft Pose Estimation

Rong

Zhu

2020

IEEE Access

View full text Add to dashboard Cite

In this paper, we are interesting in inferring 3D pose estimation of aircraft object leveraging 2D key-points localization. Monocular vision based pose estimation for aircraft can be widely utilized in airspace tasks like flight control system, air traffic management, autonomous navigation and air defense system. Nonetheless, prior methods using directly regression or classification can not meet the requirements of high precision in aircraft pose estimation context, other approaches using PnP algorithms that need additional information such as template 3D model or depth as prior knowledge. These methods do not exploit to full advantage the correlation information between 2D key-points and 3D pose. In this paper, we present a multi-branch network, named AirPose network, using convolutional neural network to address 3D pose estimation based on 2D key-points information. In the meantime, a novel feature fusion method is explored to enable orientation estimation branch adequately exploit key-points information. Our feature fusion method significantly decreases 3D pose estimation error also avoids the involvement of RANSAC based PnP algorithms. To address the problem that there is no available dedicated aircraft 3D pose dataset for training and testing, we build a visual simulation platform on Unreal Engine 4 applying domain randomization (DR) skill, named AKO platform, which generates aircraft images automatically labeled with 3D orientation and key-points location. The dataset is called AKO dataset. We implement a series of ablation experiments to evaluate our framework for aircraft object detection, key-points localization and orientation estimation on AKO dataset. Experiments show that our proposed AirPose network leveraging AKO dataset can achieve convincing results for each of the tasks.

show abstract

“…an object whose shape was not seen during training). Table 1 compares relevant works, and comprehensive reviews of object pose estimation can be found in [5,6,16,17]. Although DNN models estimate the 6 DoF object pose quite accurately, their training requires large amount of data usually annotated only for the high-level object category, containing images and/or known dense 3D models [5,6,7,16,17].…”

Section: Introductionmentioning

confidence: 99%

“…Table 1 compares relevant works, and comprehensive reviews of object pose estimation can be found in [5,6,16,17]. Although DNN models estimate the 6 DoF object pose quite accurately, their training requires large amount of data usually annotated only for the high-level object category, containing images and/or known dense 3D models [5,6,7,16,17]. For example, PoseCNN [18], DenseFusion [5], SegOPE [17] and PVNet [6] evaluate only on objects with high-quality 3D models and good visibility in depth [18].…”

Section: Introductionmentioning

confidence: 99%

Multi-View Shape Estimation of Transparent Containers

Xompero

Sánchez-Matilla

Modas

et al. 2020

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

The 3D localisation of an object and the estimation of its properties, such as shape and dimensions, are challenging under varying degrees of transparency and lighting conditions. In this paper, we propose a method for jointly localising container-like objects and estimating their dimensions using two wide-baseline, calibrated RGB cameras. Under the assumption of vertical circular symmetry, we estimate the dimensions of an object by sampling at different heights a set of sparse circumferences with iterative shape fitting and image re-projection to verify the sampling hypotheses in each camera using semantic segmentation masks. We evaluate the proposed method on a novel dataset of objects with different degrees of transparency and captured under different backgrounds and illumination conditions. Our method, which is based on RGB images only outperforms, in terms of localisation success and dimension estimation accuracy a deep-learning based approach that uses depth maps.

show abstract

Segmentation-Driven 6D Object Pose Estimation

Cited by 273 publications

References 46 publications

Cutting Pose Prediction from Point Clouds

Cutting Pose Prediction from Point Clouds

2D-Key-Points-Localization-Driven 3D Aircraft Pose Estimation

Multi-View Shape Estimation of Transparent Containers

Contact Info

Product

Resources

About