Accurate 6D Object Pose Estimation by Pose Conditioned Mesh Reconstruction

Castro, Pablo; Armagan, Anil; Kim, Taekyun

doi:10.1109/icassp40776.2020.9053627

Cited by 11 publications

(2 citation statements)

References 43 publications

(144 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While these methods fuse data coming from RGB and depth channels, a local belief propagation based approach [73] and an iterative refinement architecture [31], [32] are proposed in depth modality [74]. 6D object pose estimation is recently achieved from RGB only [164], [170], [172], [173], [174], [30], [37], [38], [40], and the current paradigm is to adopt CNNs [157], [158], [169]. BB8 [40] and Tekin et al [38] perform corner-point regression followed by PnP.…”

Section: Introductionmentioning

confidence: 99%

A review on object pose recovery: From 3D bounding box detectors to full 6D pose estimators

Şahin

Garcia-Hernando

Sock

et al. 2020

Image and Vision Computing

Self Cite

View full text Add to dashboard Cite

Object pose recovery has gained increasing attention in the computer vision field as it has become an important problem in rapidly evolving technological areas related to autonomous driving, robotics, and augmented reality. Existing review-related studies have addressed the problem at visual level in 2D, going through the methods which produce 2D bounding boxes of objects of interest in RGB images. The 2D search space is enlarged either using the geometry information available in the 3D space along with RGB (Mono/Stereo) images, or utilizing depth data from LIDAR sensors and/or RGB-D cameras. 3D bounding box detectors, producing category-level amodal 3D bounding boxes, are evaluated on gravity aligned images, while full 6D object pose estimators are mostly tested at instance-level on the images where the alignment constraint is removed. Recently, 6D object pose estimation is tackled at the level of categories. In this paper, we present the first comprehensive and most recent review of the methods on object pose recovery, from 3D bounding box detectors to full 6D pose estimators. The methods mathematically model the problem as a classification, regression, classification & regression, template matching, and point-pair feature matching task. Based on this, a mathematical-model-based categorization of the methods is established. Datasets used for evaluating the methods are investigated with respect to the challenges, and evaluation metrics are studied. Quantitative results of experiments in the literature are analysed to show which category of methods best performs across what types of challenges. The analyses are further extended comparing two methods, which are our own implementations, so that the outcomes from the public results are further solidified. Current position of the field is summarized regarding object pose recovery, and possible research directions are identified.

show abstract

Section: Introductionmentioning

confidence: 99%

A review on object pose recovery: From 3D bounding box detectors to full 6D pose estimators

Şahin

Garcia-Hernando

Sock

et al. 2020

Image and Vision Computing

Self Cite

View full text Add to dashboard Cite

show abstract

“…Thanks to the rapid development of powerful graphical processing units (GPU), the data-driven techniques have made a great leap in pose estimation [ 8 , 9 ]. Recent methods [ 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 ] can be categorized based on the types of input data, i.e., RGB or RGBD. Traditional data-driven approaches [ 23 , 24 ] utilize convolution neural network (CNN) to select candidate feature points that roughly construct a bounding box surrounding the target object in 2D image, and subsequently solve the perspective-n-point (pnp) problem based on these points for pose estimation [ 25 ].…”

Section: Introductionmentioning

confidence: 99%

Iterative Pose Refinement for Object Pose Estimation Based on RGBD Data

Huang

Hsu

Wang

et al. 2020

Sensors

View full text Add to dashboard Cite

Accurate estimation of 3D object pose is highly desirable in a wide range of applications, such as robotics and augmented reality. Although significant advancement has been made for pose estimation, there is room for further improvement. Recent pose estimation systems utilize an iterative refinement process to revise the predicted pose to obtain a better final output. However, such refinement process only takes account of geometric features for pose revision during the iteration. Motivated by this approach, this paper designs a novel iterative refinement process that deals with both color and geometric features for object pose refinement. Experiments show that the proposed method is able to reach 94.74% and 93.2% in ADD(-S) metric with only 2 iterations, outperforming the state-of-the-art methods on the LINEMOD and YCB-Video datasets, respectively.

show abstract