2020 IEEE International Conference on Image Processing (ICIP) 2020
DOI: 10.1109/icip40778.2020.9190695
|View full text |Cite
|
Sign up to set email alerts
|

MT-UNET: A Novel U-Net Based Multi-Task Architecture For Visual Scene Understanding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 6 publications
0
6
0
Order By: Relevance
“…MDWF-Net is a CNN [22,23] capable of calculating water-fat images, R2* and ∆f after receiving multi-echo GRE acquisitions as input. The architecture of MDWF-Net is based on multi-task U-Net, which has been previously proposed in the literature for signal processing tasks [18,19]. This configuration consists of an encoderdecoder structure that translates the input to a reduceddimensions latent space of features, from which water-fat images, R2* and ∆f are separately decodified (Fig.…”
Section: Multi-decoder Water-fat Separation Networkmentioning
confidence: 99%
“…MDWF-Net is a CNN [22,23] capable of calculating water-fat images, R2* and ∆f after receiving multi-echo GRE acquisitions as input. The architecture of MDWF-Net is based on multi-task U-Net, which has been previously proposed in the literature for signal processing tasks [18,19]. This configuration consists of an encoderdecoder structure that translates the input to a reduceddimensions latent space of features, from which water-fat images, R2* and ∆f are separately decodified (Fig.…”
Section: Multi-decoder Water-fat Separation Networkmentioning
confidence: 99%
“…In channel attention, the contribution of each channel in a given feature map is weighted by aggregating features spatially. Jha et al [78], for instance, apply task-specific channel-attention modules in order to highlight relevant task-specific features. In spatial attention, on the other hand, features are aggregated channel-wise, in order to provide the attention values for each spatial position in a given feature map.…”
Section: B Local and Global Context Modelingmentioning
confidence: 99%
“…Another approach, TransFuse (Zhang et al, 2021), employed a parallel combination of Transformers and CNNs to improve the efficiency of capturing global information. (Jha et al, 2020), UNet-2022 (Guo et al, 2022a), nnUNet (Isensee et al, 2021) methods for (a) Automated cardiac diagnosis (Bernard et al, 2018) and (b) Skin lesion segmentation (Gutman et al, 2016;Barata et al, 2014). The computational complexity of each method is reflected in the FLOPs(G) (Floating Point Operations) metric, while the segmentation performance is measured by the DSC(%) (Dice Similarity Coefficient).…”
Section: Medical Image Segmentationmentioning
confidence: 99%