2022
DOI: 10.3389/fonc.2022.942511
|View full text |Cite
|
Sign up to set email alerts
|

Swin transformer-based GAN for multi-modal medical image translation

Abstract: Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, existing methods still have two shortcomings: 1) the Pix2Pix requires paired and pixel-aligned images, which are difficult to acquire. Nevertheless, the optimum output of the cycle-consistency model may not be unique… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(11 citation statements)
references
References 35 publications
0
11
0
Order By: Relevance
“…To quantitatively evaluate the performance, we used mean absolute error (MAE), SSIM, and peak signal-to-noise ratio (PSNR) metrics. The performances of the models were compared between modified U-Net and other DL models 17,23–25 . Augmentation techniques such as scaling, rotating, and flipping were implemented during the training process to enhance the dataset.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To quantitatively evaluate the performance, we used mean absolute error (MAE), SSIM, and peak signal-to-noise ratio (PSNR) metrics. The performances of the models were compared between modified U-Net and other DL models 17,23–25 . Augmentation techniques such as scaling, rotating, and flipping were implemented during the training process to enhance the dataset.…”
Section: Methodsmentioning
confidence: 99%
“…The performances of the models were compared between modified U-Net and other DL models. 17,[23][24][25] Augmentation techniques such as scaling, rotating, and flipping were implemented during the training process to enhance the dataset.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…After preprocessing, deep learning models were proposed to segment lesions, which greatly improved future work efficiency. This work selected a deep learning model method based on the transformer architecture (Swin transformer) because of its superiority in multiple domains [ 15 , 16 ]. The Swin transformer adopts a hierarchical design containing a total of four stages: each stage decreases the resolution of the input feature map and expands the perceptual field layer by layer, similar to a convolutional neural network.…”
Section: Methodsmentioning
confidence: 99%
“…The transformer-based network architecture has demonstrated competitive performance in various generative models [45][46][47]. Inspired by this, we construct a score network named 'TransDiff ' , which use the swin transformer layer as the backbone, aiming to enhance its feature extraction capabilities and achieve self-attention from local to global.…”
Section: Sore Network 'Transdiff 'mentioning
confidence: 99%