“…Most existing pose estimation methods [2,3,5,12,16,18,24,37,44,49,52] are object-specific pose estimators, which are specialized for pre-defined objects and cannot generalize to previously unseen objects without retraining. Some of them [2,3,5,18,49,52] directly regress the 6D pose parameters from RGB images by training deep neural networks on a large number of labeled images. While other approaches [5,12,16,24,36,37,44] establish 2D-3D correspondences between 2D images and 3D object models to estimate the 6D pose by solving the Perspective-n-Point (PnP) [21] problem.…”