2023
DOI: 10.1038/s41598-023-34341-2
|View full text |Cite
|
Sign up to set email alerts
|

Denoising diffusion probabilistic models for 3D medical image generation

Abstract: Recent advances in computer vision have shown promising results in image generation. Diffusion probabilistic models have generated realistic images from textual input, as demonstrated by DALL-E 2, Imagen, and Stable Diffusion. However, their use in medicine, where imaging data typically comprises three-dimensional volumes, has not been systematically evaluated. Synthetic images may play a crucial role in privacy-preserving artificial intelligence and can also be used to augment small datasets. We show that dif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
15
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(18 citation statements)
references
References 29 publications
(15 reference statements)
1
15
0
Order By: Relevance
“…For our experiments, we chose six recent 3D models with a range of sizes, from 1.8 million (M) to 27M parameters. This included: the SwinT vision transformer with 27M parameters, as adapted for video analysis [13], the multi-instance neuroimage transformer (MiNiT) with 3.6M parameters, and the Neuroimage Transformer (NiT) with 2M parameters, recently adapted for neuroimage applications [20]. Further, we used a commonly used architecture in neuroimaging -the DenseNet121 convolution-al neural network (CNN) [9] with 11M parameters, and a scaled down version with 10 times fewer parameters, Tiny-DenseNet CNN [6] with 1.8M parameters.…”
Section: Vision Architecturesmentioning
confidence: 99%
“…For our experiments, we chose six recent 3D models with a range of sizes, from 1.8 million (M) to 27M parameters. This included: the SwinT vision transformer with 27M parameters, as adapted for video analysis [13], the multi-instance neuroimage transformer (MiNiT) with 3.6M parameters, and the Neuroimage Transformer (NiT) with 2M parameters, recently adapted for neuroimage applications [20]. Further, we used a commonly used architecture in neuroimaging -the DenseNet121 convolution-al neural network (CNN) [9] with 11M parameters, and a scaled down version with 10 times fewer parameters, Tiny-DenseNet CNN [6] with 1.8M parameters.…”
Section: Vision Architecturesmentioning
confidence: 99%
“…[23][24][25]29 Diffusion models have been used to generate magnetic resonance imaging (MRI) and computed tomography (CT) scans to augment the training datasets of deep learning models. 21,27,30 Despite advances in generative models, limitations persist in achieving synthetic biological images that look realistic, as assessed by rigorous metrics. [21][22][23][24][25][26][27][28][31][32][33] Issues such as the accurate representation of subtle or rare textures, cell arrangements, and tissue boundaries are areas of active research.…”
Section: Introductionmentioning
confidence: 99%
“…21,27,30 Despite advances in generative models, limitations persist in achieving synthetic biological images that look realistic, as assessed by rigorous metrics. [21][22][23][24][25][26][27][28][31][32][33] Issues such as the accurate representation of subtle or rare textures, cell arrangements, and tissue boundaries are areas of active research. 22,26 Here, we explore interpolation techniques, such as frame interpolation for large motion (FILM), to enhance the resolution of 3D biological images.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…While 2D- or 3D-patch-based approaches showcase significant utility for selected downstream tasks, particular downstream tasks such as hemodynamic modeling (Rundfeldt et al, 2024) or 3D classification demand realistic anatomical accuracy and spatial continuity from the synthesized vessel volumes. To tackle the research gap of 3D modeling, prior works have proposed 3D adaptations of generative adversarial networks (Hong et al, 2021; Mensing et al, 2022) and denoising diffusion probabilistic models for medical images (Khader et al, 2023).…”
Section: Introductionmentioning
confidence: 99%