Zhongchen Shi scite author profile

The random sample consensus (RANSAC) based algorithm is widely used in estimating the two-view geometry from image point correspondences. However, it often becomes extremely slow when the data is contaminated by a large percentage of incorrect matches. To address this problem, the paper proposes a new modification of RANSAC called LP-RANSAC that is robust to varying inlier ratios and achieves large computational savings without deterioration in accuracy. LP-RANSAC integrates the locality preserving constraint into the universal RANSAC framework, which prunes most of the unreliable correspondences before the hypothesize-and-verify loop and guides non-uniform sampling to generate and verify promising models earlier. Unlike other guided sampling strategies, the proposed method is simple to implement and does not require any prior information. Extensive experiments performed on the publicly available datasets reveal that LP-RANSAC can achieve more accurate and stable solutions at much lower computational cost (in milliseconds on standard CPU) than state-of-the-art methods, particularly when handling problems with low inlier ratios. INDEX TERMS Robust estimation, RANSAC, outlier removal, image matching, two-view geometry.

show abstract

DLT-Lines Based Camera Calibration with Lens Radial and Tangential Distortion

Shi

Shang

Zhang³

et al. 2021

Exp Mech

View full text Add to dashboard Cite

Precise monocular vision-based pose measurement system for lunar surface sampling manipulator

Gang

Shi

Shang

et al. 2019

Sci. China Technol. Sci.

View full text Add to dashboard Cite

TCU-Net: Transformer Embedded in Convolutional U-Shaped Network for Retinal Vessel Segmentation

Shi

Zou

et al. 2023

Sensors

View full text Add to dashboard Cite

Optical coherence tomography angiography (OCTA) provides a detailed visualization of the vascular system to aid in the detection and diagnosis of ophthalmic disease. However, accurately extracting microvascular details from OCTA images remains a challenging task due to the limitations of pure convolutional networks. We propose a novel end-to-end transformer-based network architecture called TCU-Net for OCTA retinal vessel segmentation tasks. To address the loss of vascular features of convolutional operations, an efficient cross-fusion transformer module is introduced to replace the original skip connection of U-Net. The transformer module interacts with the encoder’s multiscale vascular features to enrich vascular information and achieve linear computational complexity. Additionally, we design an efficient channel-wise cross attention module to fuse the multiscale features and fine-grained details from the decoding stages, resolving the semantic bias between them and enhancing effective vascular information. This model has been evaluated on the dedicated Retinal OCTA Segmentation (ROSE) dataset. The accuracy values of TCU-Net tested on the ROSE-1 dataset with SVC, DVC, and SVC+DVC are 0.9230, 0.9912, and 0.9042, respectively, and the corresponding AUC values are 0.9512, 0.9823, and 0.9170. For the ROSE-2 dataset, the accuracy and AUC are 0.9454 and 0.8623, respectively. The experiments demonstrate that TCU-Net outperforms state-of-the-art approaches regarding vessel segmentation performance and robustness.

show abstract

Reconstruction-based 6D pose estimation for robotic assembly

Shi

et al. 2020

Appl. Opt.

View full text Add to dashboard Cite

Pose estimation is important for many robotic applications including bin picking and robotic assembly and collaboration. However, robust and accurate estimation of the poses of industrial objects is a challenging task owing to the various object shapes and complex working environments. This paper presents a method of estimating the poses of narrow and elongated industrial objects with a low-cost RGB-D (depth and color) camera to guide the process of robotic assembly. The proposed method comprises three main steps: reconstruction involved in preprocessing, pose initialization with geometric features, and tracking aided by contour cues. Pose tracking is coupled with real-time dense reconstruction, which can synthesize a smooth depth image as a substitute for the raw depth image. Because industrial objects (e.g., fork and adapter) feature mostly planar structures, primitive geometric features, such as three-dimensional planes, are extracted from the point cloud and utilized to induce a promising initial pose. For robust tracking of the adapter consisting of narrow and elongated planes, the dense surface correspondences are combined with sparse contour correspondences in the refinement scheme. This combination allows for a satisfactory tolerance to the initial guess in the pose tracking phase. The experimental results demonstrate the feasibility of the proposed method.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zhongchen Shi

Two-View Geometry Estimation Using RANSAC With Locality Preserving Constraint

DLT-Lines Based Camera Calibration with Lens Radial and Tangential Distortion

Precise monocular vision-based pose measurement system for lunar surface sampling manipulator

TCU-Net: Transformer Embedded in Convolutional U-Shaped Network for Retinal Vessel Segmentation

Reconstruction-based 6D pose estimation for robotic assembly

Contact Info

Product

Resources

About