Recently, unsupervised domain adaptation in satellite pose estimation has gained increasing attention, aiming at alleviating the annotation cost for training deep models. To this end, we propose a self-training framework based on the domain-agnostic geometrical constraints. Specifically, we train a neural network to predict the 2D keypoints of a satellite and then use PnP to estimate the pose. The poses of target samples are regarded as latent variables to formulate the task as a minimization problem. Furthermore, we leverage fine-grained segmentation to tackle the information loss issue caused by abstracting the satellite as sparse keypoints. Finally, we iteratively solve the minimization problem in two steps: pseudo-label generation and network training. Experimental results show that our method adapts well to the target domain. Moreover, our method won the 1st place on the sunlamp task of the second international Satellite Pose Estimation Competition.
Pose estimation is important for many robotic applications including bin picking and robotic assembly and collaboration. However, robust and accurate estimation of the poses of industrial objects is a challenging task owing to the various object shapes and complex working environments. This paper presents a method of estimating the poses of narrow and elongated industrial objects with a low-cost RGB-D (depth and color) camera to guide the process of robotic assembly. The proposed method comprises three main steps: reconstruction involved in preprocessing, pose initialization with geometric features, and tracking aided by contour cues. Pose tracking is coupled with real-time dense reconstruction, which can synthesize a smooth depth image as a substitute for the raw depth image. Because industrial objects (e.g., fork and adapter) feature mostly planar structures, primitive geometric features, such as three-dimensional planes, are extracted from the point cloud and utilized to induce a promising initial pose. For robust tracking of the adapter consisting of narrow and elongated planes, the dense surface correspondences are combined with sparse contour correspondences in the refinement scheme. This combination allows for a satisfactory tolerance to the initial guess in the pose tracking phase. The experimental results demonstrate the feasibility of the proposed method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.