2022
DOI: 10.1007/s11633-022-1371-y
|View full text |Cite
|
Sign up to set email alerts
|

Video Polyp Segmentation: A Deep Learning Perspective

Abstract: We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era. Over the years, developments in VPS are not moving forward with ease due to the lack of a large-scale dataset with fine-grained segmentation annotations. To address this issue, we first introduce a high-quality frame-by-frame annotated VPS dataset, named SUN-SEG, which contains 158 690 colonoscopy video frames from the well-known SUN-database. We provide additional annotation covering diverse types, i.e., attribut… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 58 publications
(27 citation statements)
references
References 84 publications
0
27
0
Order By: Relevance
“…Hybrid 2/3D CNN framework [33] is used to aggregate spatio-temporal correlation and obtain better segmentation results. PNS+ [18] is the first study to comprehensively introduce the work related to video polyp segmentation in deep learning and the first to introduce a high-quality fine-grained annotated VPS dataset named SUN-SEG [30]. At the same time, a global encoder and a local encoder are designed in PNS+ to extract the long-term and short-term feature representation, respectively, and introduce a self-attention block to update the receptive field dynamically.…”
Section: Polyp Segmentationmentioning
confidence: 99%
See 3 more Smart Citations
“…Hybrid 2/3D CNN framework [33] is used to aggregate spatio-temporal correlation and obtain better segmentation results. PNS+ [18] is the first study to comprehensively introduce the work related to video polyp segmentation in deep learning and the first to introduce a high-quality fine-grained annotated VPS dataset named SUN-SEG [30]. At the same time, a global encoder and a local encoder are designed in PNS+ to extract the long-term and short-term feature representation, respectively, and introduce a self-attention block to update the receptive field dynamically.…”
Section: Polyp Segmentationmentioning
confidence: 99%
“…It contains 1,106 short video clips with a total of 158,690 frames, including 378 positive and 728 negative cases. We follow the same training/testing setting as in PNS+ [18] and only conduct experiments on positive cases. For training, we use 40% of the SUN-SEG dataset, including 112 clips with 19,544 frames.…”
Section: Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore, camouflaged object detection (COD) presents a significantly more intricate challenge compared to traditional salient object detection (SOD) or other object segmentation. Recently, it has piqued ever-growing research interest from the computer vision community and facilitates many valuable real-life applications, such as search and rescue [1], species discovery [2], medical analysis (e.g., polyp segmentation [3], [4], [5], lung infection segmentation [6], and cell segmentation [7]), agricultural management [8], [9], and industrial defect detection [10].…”
Section: Introductionmentioning
confidence: 99%