2023
DOI: 10.1016/j.media.2022.102599
|View full text |Cite
|
Sign up to set email alerts
|

Volumetric memory network for interactive medical image segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(16 citation statements)
references
References 34 publications
0
7
0
Order By: Relevance
“…ResUNet++ also uses a conditional random field (CRF) and test time augmentation (TTA) for better prediction efficiency. Along these lines, many studies have been conducted on medical image segmentation based on deep learning [ 40 , 41 , 42 , 43 , 44 , 45 ]. In the present work, the proposed structure is designed to improve a U-Net model by adding residual modules.…”
Section: Related Workmentioning
confidence: 99%
“…ResUNet++ also uses a conditional random field (CRF) and test time augmentation (TTA) for better prediction efficiency. Along these lines, many studies have been conducted on medical image segmentation based on deep learning [ 40 , 41 , 42 , 43 , 44 , 45 ]. In the present work, the proposed structure is designed to improve a U-Net model by adding residual modules.…”
Section: Related Workmentioning
confidence: 99%
“…In the image segmentation field, a lot of emerging technologies have been successively introduced for improvement in recent years, such as graph convolution [33], prototypebased classification [34], and the memory-augmented network [35], while the most commonly used network architecture is still the Encoder-Decoder structure. The encoder progressively enlarges receptive fields to capture sufficient object semantic information, and the decoder is used to recover the spatial size and detail of deeply encoded features for pixel-level predictions.…”
Section: Encoder-decoder Segmentation Modelmentioning
confidence: 99%
“…Long-range dependency modeling has been extensively studied in many fields, such as video segmentation [23] and image segmentation [24]. The self-attention module is one of the first to model pairwise long-range relations.…”
Section: Pairwise Long-range Dependency Modelingmentioning
confidence: 99%
“…In particular, in standard convolution, the dilation rate r = 1. As shown in Figure 3, we apply multi-scale dilation convolution with four branches with rates (r = 6,12,18,24). Each of them has padding = r and stride = 1 to maintain the resolution of the input feature map.…”
Section: Local Context Refinement Modulementioning
confidence: 99%