Fangbo Qin scite author profile

This paper proposes a novel deep convolutional model, Tri-Points Based Line Segment Detector (TP-LSD), to detect line segments in an image at real-time speed. The previous related methods typically use the two-step strategy, relying on either heuristic post-process or extra classifier. To realize one-step detection with a faster and more compact model, we introduce the tri-points representation, converting the line segment detection to the end-to-end prediction of a root-point and two endpoints for each line segment. TP-LSD has two branches: tri-points extraction branch and line segmentation branch. The former predicts the heat map of root-points and the two displacement maps of endpoints. The latter segments the pixels on straight lines out from background. Moreover, the line segmentation map is reused in the first branch as structural prior. We propose an additional novel evaluation metric and evaluate our method on Wireframe and YorkUrban datasets, demonstrating not only the competitive accuracy compared to the most recent methods, but also the real-time run speed up to 78 FPS with the 320 × 320 input.

show abstract

Surgical instrument segmentation for endoscopic vision with data fusion of cnn prediction and kinematic pose

Qin

et al. 2019

View full text Add to dashboard Cite

TP-LSD: Tri-Points Based Line Segment Detector

Huang¹,

Qin

Xiong³

et al. 2020

View full text Add to dashboard Cite

Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision

Qin

Lin

et al. 2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images

Lin

Qin

et al. 2020

View full text Add to dashboard Cite

The intelligent perception of endoscopic vision is appealing in many computer-assisted and robotic surgeries. Achieving good vision-based analysis with deep learning techniques requires large labeled datasets, but manual data labeling is expensive and time-consuming in medical problems. When applying a trained model to a different but relevant dataset, a new labeled dataset may be required for training to avoid performance degradation. In this work, we investigate a novel cross-domain strategy to reduce the need for manual data labeling by proposing an image-to-image translation model called live-cadaver GAN (LC-GAN) based on generative adversarial networks (GANs). More specifically, we consider a situation when a labeled cadaveric surgery dataset is available while the task is instrument segmentation on a live surgery dataset. We train LC-GAN to learn the mappings between the cadaveric and live datasets. To achieve instrument segmentation on live images, we can first translate the live images to fake-cadaveric images with LC-GAN, and then perform segmentation on the fake-cadaveric images with models trained on the real cadaveric dataset. With this cross-domain strategy, we fully leverage the labeled cadaveric dataset for segmentation on live images without the need to label the live dataset again. Two generators with different architectures are designed for LC-GAN to make use of the deep feature representation learned from the cadaveric image based instrument segmentation task. Moreover, we propose structural similarity loss and segmentation consistency loss to improve the semantic consistency during translation. The results demonstrate that LC-GAN achieves better image-toimage translation results, and leads to improved segmentation performance in the proposed cross-domain segmentation task.

show abstract

Robotic Skill Learning for Precision Assembly With Microscopic Vision and Force Feedback

Qin

Zhang

et al. 2019

IEEE/ASME Trans. Mechatron.

View full text Add to dashboard Cite

Laser Beam Pointing Control With Piezoelectric Actuator Model Learning

Qin

Zhang

Dai

et al. 2020

IEEE Trans. Syst. Man Cybern, Syst.

View full text Add to dashboard Cite

Efficient Insertion Control for Precision Assembly Based on Demonstration Learning and Reinforcement Learning

Qin

2021

IEEE Trans. Ind. Inf.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Fangbo Qin

TP-LSD: Tri-Points Based Line Segment Detector

Surgical instrument segmentation for endoscopic vision with data fusion of cnn prediction and kinematic pose

TP-LSD: Tri-Points Based Line Segment Detector

Towards Better Surgical Instrument Segmentation in Endoscopic Vision: Multi-Angle Feature Aggregation and Contour Supervision

LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images

Robotic Skill Learning for Precision Assembly With Microscopic Vision and Force Feedback

Laser Beam Pointing Control With Piezoelectric Actuator Model Learning

Efficient Insertion Control for Precision Assembly Based on Demonstration Learning and Reinforcement Learning

Contact Info

Product

Resources

About