Proceedings of the 14th Hamlyn Symposium on Medical Robotics 2022 2022
DOI: 10.31256/hsmr2022.22
|View full text |Cite
|
Sign up to set email alerts
|

A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its effect on Image Correspondence

Abstract: Computer vision has been utilized to analyze mini- mally invasive surgery videos and aid with polyp detec- tion, tool localization, and organ 3D modelling tasks. However, irregular light patterns such as saturation, specular highlights, or extreme contrasts occlude texture and hinder these tasks. In this work, specular highlights were removed and the occluded data was reconstructed. To do that, an unsupervised temporal generative ad- versarial network (GAN) was used to inpaint specular highlights spat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 65 publications
0
4
0
Order By: Relevance
“…Compared with the STTN model, the training method based on pseudo dataset can generate images with more real details, which has a positive effect on image filling [13]. The PSNR value of this model is 29.542, which is higher than that of STTN model (28.683), but the advantage is not obvious.…”
Section: Obstacle Removalmentioning
confidence: 94%
See 1 more Smart Citation
“…Compared with the STTN model, the training method based on pseudo dataset can generate images with more real details, which has a positive effect on image filling [13]. The PSNR value of this model is 29.542, which is higher than that of STTN model (28.683), but the advantage is not obvious.…”
Section: Obstacle Removalmentioning
confidence: 94%
“…The latest method used at present is to eliminate the reflection of the endoscope in the video by removing the highlights in the video stream and then automatically filling the exact image according to the training results of the model [13]. The system generates "a pseudo ground truth dataset" by separating part of Hyper-Kvasir's video data, which is used for feature learning of the STTN system to make the image easily filled more real.…”
Section: Obstacle Removalmentioning
confidence: 99%
“…The result of these lighting aberrations is seen as temporal inconsistency and color flickering 23 , 24 . Recent efforts, such as those by Daher et al., 25 have tried to harness temporal data using traditional nonlearning methods to detect SR regions. However, their limited accuracy in SR detection causes temporal-aware generative adversarial models to unintentionally include undetected SR from other regions, resulting in noisy reconstructions.…”
Section: Related Workmentioning
confidence: 99%
“…A weakly supervised approach employed a two-stage network to detect and suppress specular highlights on laparoscopy endometriosis images using U-Net architecture [13]. In [14], they fine-tuned a Spatial-Temporal Transformer Network with pseudo-groundtruth of specularity masks, achieving an unsupervised approach to remove specular highlight on gastric videos.…”
Section: Specularitymentioning
confidence: 99%