Video Polyp Segmentation: A Deep Learning Perspective

Ji, Ge-Peng; Xiao, Guobao; Chou, Yu-Cheng; Fan, Deng-Ping; Zhao, Kai; Chen, Geng; Gool, Luc Van

doi:10.1007/s11633-022-1371-y

Cited by 58 publications

(27 citation statements)

References 84 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hybrid 2/3D CNN framework [33] is used to aggregate spatio-temporal correlation and obtain better segmentation results. PNS+ [18] is the first study to comprehensively introduce the work related to video polyp segmentation in deep learning and the first to introduce a high-quality fine-grained annotated VPS dataset named SUN-SEG [30]. At the same time, a global encoder and a local encoder are designed in PNS+ to extract the long-term and short-term feature representation, respectively, and introduce a self-attention block to update the receptive field dynamically.…”

Section: Polyp Segmentationmentioning

confidence: 99%

“…It contains 1,106 short video clips with a total of 158,690 frames, including 378 positive and 728 negative cases. We follow the same training/testing setting as in PNS+ [18] and only conduct experiments on positive cases. For training, we use 40% of the SUN-SEG dataset, including 112 clips with 19,544 frames.…”

Section: Datasetsmentioning

confidence: 99%

“…Instead of relying on images, methods have been proposed to use colonoscopy videos fully. These methods are categorized as video polyp segmentation (VPS), where convolutional neural networks (CNNs) have been widely employed [33,15,18,44,43,20,55]. For instance, Puyal et al [33] proposed a hybrid VPS framework, where a 2D network acts as the backbone for extracting spatial features and a 3D network ensures temporal consistency.…”

Section: Introductionmentioning

confidence: 99%

“…For instance, Puyal et al [33] proposed a hybrid VPS framework, where a 2D network acts as the backbone for extracting spatial features and a 3D network ensures temporal consistency. Ji et al [18] comprehensively introduced the work related to video polyp segmentation in deep learning and the proposed model, PNS+, is the first to introduce a high-quality fine-grained annotated VPS dataset named SUN-SEG [30].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Research on Application of Big Data Technology in Cost Control of Medical Insurance For China

Chen¹

2021

dtcse

View full text Add to dashboard Cite

With the increasing aging of the population, medical insurance cost control has become a worldwide problem. In order to control the medical insurance fee, since 2009, China's medical insurance payment system has changed from the post payment system mainly based on projects to the prepayment system based on total amount control, but not only failed to control the growth of medical insurance expenditure, but also resulted in the phenomenon that the insured's right to medical treatment has been damaged. Using big data technology and other information-based means to fine manage medical insurance expenditure has become an urgent matter. The paper is divided into four parts. The first part introduces the characteristics of big data and the concept of big data technology; the second part, under the background of the reform of China's medical insurance payment system, analyzes the necessity and possibility of using big data technology to control medical insurance expenses; the third part, puts forward some measures of using big data technology to control medical insurance expenses; finally, the conclusion is drawn that the use of big data technology can realize the fine management of medical insurance control fee.

show abstract

Section: Polyp Segmentationmentioning

confidence: 99%

Section: Datasetsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Research on Application of Big Data Technology in Cost Control of Medical Insurance For China

Chen¹

2021

dtcse

View full text Add to dashboard Cite

show abstract

“…Therefore, camouflaged object detection (COD) presents a significantly more intricate challenge compared to traditional salient object detection (SOD) or other object segmentation. Recently, it has piqued ever-growing research interest from the computer vision community and facilitates many valuable real-life applications, such as search and rescue [1], species discovery [2], medical analysis (e.g., polyp segmentation [3], [4], [5], lung infection segmentation [6], and cell segmentation [7]), agricultural management [8], [9], and industrial defect detection [10].…”

Section: Introductionmentioning

confidence: 99%

Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection

Pang

Zhao

Xiang

et al. 2022

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

133

View full text Add to dashboard Cite

Recent camouflaged object detection (COD) attempts to segment objects visually blended into their surroundings, which is extremely complex and difficult in real-world scenarios. Apart from the high intrinsic similarity between camouflaged objects and their background, objects are usually diverse in scale, fuzzy in appearance, and even severely occluded. To this end, we propose an effective unified collaborative pyramid network which mimics human behavior when observing vague images and videos, i.e., zooming in and out. Specifically, our approach employs the zooming strategy to learn discriminative mixed-scale semantics by the multi-head scale integration and rich granularity perception units, which are designed to fully explore imperceptible clues between candidate objects and background surroundings. The former's intrinsic multi-head aggregation provides more diverse visual patterns. The latter's routing mechanism can effectively propagate inter-frame difference in spatiotemporal scenarios and adaptively ignore static representations. They provides a solid foundation for realizing a unified architecture for static and dynamic COD. Moreover, considering the uncertainty and ambiguity derived from indistinguishable textures, we construct a simple yet effective regularization, uncertainty awareness loss, to encourage predictions with higher confidence in candidate regions. Our highly task-friendly framework consistently outperforms existing state-of-the-art methods in image and video COD benchmarks. The code will be available at https://github.com/lartpang/ZoomNeXt.

show abstract

Polyp segmentation network based on lightweight model and reverse attention mechanisms

Long,

Yang,

Song

et al. 2024

Int J Imaging Syst Tech

View full text Add to dashboard Cite

Colorectal cancer is a common gastrointestinal malignancy. Early screening and segmentation of colorectal polyps are of great clinical significance. Colonoscopy is the most effective method to detect polyps, but some polyps may be missed in the detection process. On this basis, the use of computer‐aided diagnosis technology is particularly important for colorectal polyp segmentation. To improve the detection rate of intestinal polyps under colonoscopy, a polyp segmentation network (MobileRaNet) based on a lightweight model and reverse attention (RA) mechanism was proposed to accurately segment polyps in colonoscopy images. The coordinated attention module is used to improve MobileNetV3 and make it the backbone network (CaNet). Second, a part of the output of the high‐level feature from the backbone network is passed into the parallel axial receptive field module (PA_RFB) to extract the global dependency representation without losing the details. Third, a global map is generated based on this combined feature as the initial boot area of the subsequent components. Finally, the RA module is used to mine the target region and boundary clues to improve the segmentation accuracy. To verify the effectiveness and lightweight performance of the algorithm, five challenging datasets, including CVC‐ColonDB, CVC‐300, and Kvasir, are used in this paper. In six indexes, including MeanDice, MeanIoU, and MAE, compared with seven typical models such as PraNet and TransUnet, accuracy, FLOPs, parameters, and FPS were compared. The experimental results show that the MobileRaNet proposed in this paper has improved the performance of the five datasets to varying degrees, especially the MeanDice and MeanIOU indexes of the Kvasir dataset reach 91.2% and 85.6%, which are, respectively, increased by 1.4% and 1.6% compared with PraNet. Compared with PraNet, FLOPs and parameters decreased by 83.3% and 76.7%, respectively.

show abstract

Video Polyp Segmentation: A Deep Learning Perspective

Cited by 58 publications

References 84 publications

Research on Application of Big Data Technology in Cost Control of Medical Insurance For China

Research on Application of Big Data Technology in Cost Control of Medical Insurance For China

Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection

Polyp segmentation network based on lightweight model and reverse attention mechanisms

Contact Info

Product

Resources

About