Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3416270
|View full text |Cite
|
Sign up to set email alerts
|

Dual Attention GANs for Semantic Image Synthesis

Abstract: We propose a novel edge guided generative adversarial network with contrastive learning (ECGAN) for the challenging semantic image synthesis task. Although considerable improvements have been achieved by the community in the recent period, the quality of synthesized images is far from satisfactory due to three largely unresolved challenges. 1) The semantic labels do not provide detailed structural information, making it challenging to synthesize local details and structures; 2) The widely adopted CNN operation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
4
2

Relationship

2
8

Authors

Journals

citations
Cited by 65 publications
(36 citation statements)
references
References 84 publications
0
36
0
Order By: Relevance
“…After summing with the temporal embedding E t ∈ R T ×C×V , the resulting feature vector Z is sent to the AniFormer encoder. In our case, original batch normalization layers are replaced with Instance Normalization [29] layers to preserve the instance style [15,19,27,28]. AniFormer Encoder.…”
Section: Aniformer: Transformer-based Network For 3d Animationmentioning
confidence: 99%
“…After summing with the temporal embedding E t ∈ R T ×C×V , the resulting feature vector Z is sent to the AniFormer encoder. In our case, original batch normalization layers are replaced with Instance Normalization [29] layers to preserve the instance style [15,19,27,28]. AniFormer Encoder.…”
Section: Aniformer: Transformer-based Network For 3d Animationmentioning
confidence: 99%
“…Generative Adversarial Networks (GANs). Over the last few years, GANs [16] have been shown effectively in many image generation and translation tasks [18,24,37,[39][40][41][42][43][44]57]. For example, Isola et al [18] propose Pix2Pix adversarial learning framework for paired image generation.…”
Section: Related Workmentioning
confidence: 99%
“…Pix2pixHD [42] improves Pix2Pix by proposing coarseto-fine generator and discriminators. Subsequent meth-ods [32,27,39,46,37,51] further explore how to synthesize high quality images from semantic masks and achieve significant improvements. Besides using class-level semantic masks, some works also consider instance-level information for image synthesis, since the semantic mask itself does not provide sufficient information to synthesize instances especially in complex environments with multiple of them interacting with each other.…”
Section: Conditional Image Synthesismentioning
confidence: 99%