2021
DOI: 10.48550/arxiv.2112.00390
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SegDiff: Image Segmentation with Diffusion Probabilistic Models

Abstract: Diffusion Probabilistic Methods are employed for stateof-the-art image generation. In this work, we present a method for extending such models for performing image segmentation. The method learns end-to-end, without relying on a pre-trained backbone. The information in the input image and in the current estimation of the segmentation map is merged by summing the output of two encoders. Additional encoding layers and a decoder are then used to iteratively refine the segmentation map using a diffusion model. Sin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
46
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 41 publications
(50 citation statements)
references
References 42 publications
1
46
0
Order By: Relevance
“…Nevertheless, these iterative algorithms are still considerably slower than GANs, so substantial work has been invested in improving their speed without compromising significantly on generation quality [258,135,247], often achieving impressive speedup levels. Diffusion models have since become ubiquitous in many applications [142,209,21,116,6,253,254,144], prompting researchers to prepare surveys of their impact on the image processing field and beyond [315,60,36]. Figure 8.1: Temporal steps along 3 independent synthesis paths of the Annealed Langevin Dynamics [260] algorithm, using a denoiser [261] trained on LSUN bedroom [319] images.…”
Section: Regularization By Denoising (Red)mentioning
confidence: 99%
“…Nevertheless, these iterative algorithms are still considerably slower than GANs, so substantial work has been invested in improving their speed without compromising significantly on generation quality [258,135,247], often achieving impressive speedup levels. Diffusion models have since become ubiquitous in many applications [142,209,21,116,6,253,254,144], prompting researchers to prepare surveys of their impact on the image processing field and beyond [315,60,36]. Figure 8.1: Temporal steps along 3 independent synthesis paths of the Annealed Langevin Dynamics [260] algorithm, using a denoiser [261] trained on LSUN bedroom [319] images.…”
Section: Regularization By Denoising (Red)mentioning
confidence: 99%
“…To date, diffusion models have been found to be useful in a wide variety of areas, ranging from generative modeling tasks such as image generation [23], image super-resolution [24], image inpainting [25] to discriminative tasks such as image segmentation [26], classification [27], and anomaly detection [28]. Recently, the medical imaging community has witnessed exponential growth in the number of diffusion-based techniques (see Figure 4).…”
Section: Denoising Diffusion Modelsmentioning
confidence: 99%
“…Existing generalist models Image Prediction Groundtruth are based on autoregressive models, while our work is based on Bit Diffusion [12,23,50,51]. Diffusion models have been applied to semantic segmentation, directly [1,27,63] or in-directly [2,4]. However none of these methods model segmentation masks as discrete/categorical tokens, nor are their models capable of video segmentation.…”
Section: Related Workmentioning
confidence: 99%