Mayank Golhar scite author profile

While data-driven approaches excel at many image analysis tasks, the performance of these approaches is often limited by a shortage of annotated data available for training. Recent work in semi-supervised learning has shown that meaningful representations of images can be obtained from training with large quantities of unlabeled data, and that these representations can improve the performance of supervised tasks. Here, we demonstrate that an unsupervised jigsaw learning task, in combination with supervised training, results in up to a 9.8% improvement in correctly classifying lesions in colonoscopy images when compared to a fully-supervised baseline. We additionally benchmark improvements in domain adaptation and out-of-distribution detection, and demonstrate that semi-supervised learning outperforms supervised learning in both cases. In colonoscopy applications, these metrics are important given the skill required for endoscopic assessment of lesions, the wide variety of endoscopy systems in use, and the homogeneity that is typical of labeled datasets.

show abstract

Blood Vessel Delineation in Endoscopic Images with Deep Learning Based Scene Classification

Golhar

Iwahori

Bhuyan

et al. 2018

View full text Add to dashboard Cite

Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration

Bobrow¹,

Golhar²,

Vijayan³

et al. 2022

Preprint

View full text Add to dashboard Cite

Screening colonoscopy is an important clinical application for several 3D computer vision techniques, including depth estimation, surface reconstruction, and missing region detection. However, the development, evaluation, and comparison of these techniques in real colonoscopy videos remain largely qualitative due to the difficulty of acquiring ground truth data. In this work, we present a Colonoscopy 3D Video Dataset (C3VD) acquired with a high definition clinical colonoscope and high-fidelity colon models for benchmarking computer vision methods in colonoscopy. We introduce a novel multimodal 2D-3D registration technique to register optical video sequences with ground truth rendered views of a known 3D model. The different modalities are registered by transforming optical images to depth maps with a Generative Adversarial Network and aligning edge features with an evolutionary optimizer. This registration method achieves an average translation error of 0.321 millimeters and an average rotation error of 0.159 degrees in simulation experiments where error-free ground truth is available. The method also leverages video information, improving registration accuracy by 55.6% for translation and 60.4% for rotation compared to single frame registration. 22 short video sequences were registered to generate 10,015 total frames with paired ground truth depth, surface normals, optical flow, occlusion, six degree-of-freedom pose, coverage maps, and 3D models. The dataset also includes screening videos acquired by a gastroenterologist with paired ground truth pose and 3D surface models. The dataset and registration source code are available at durr.jhu.edu/C3VD.

show abstract

A Robust Method for Blood Vessel Extraction in Endoscopic Images with SVM-based Scene Classification

Golhar

Iwahori

Bhuyan

et al. 2017

View full text Add to dashboard Cite

GAN Inversion for Data Augmentation to Improve Colonoscopy Lesion Classification

Golhar¹,

Bobrow²,

Ngamruengphong³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mayank Golhar

Improving Colonoscopy Lesion Classification Using Semi-Supervised Deep Learning

Blood Vessel Delineation in Endoscopic Images with Deep Learning Based Scene Classification

Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration

A Robust Method for Blood Vessel Extraction in Endoscopic Images with SVM-based Scene Classification

GAN Inversion for Data Augmentation to Improve Colonoscopy Lesion Classification

Contact Info

Product

Resources

About