2019 IEEE Winter Conference on Applications of Computer Vision (WACV) 2019
DOI: 10.1109/wacv.2019.00044
|View full text |Cite
|
Sign up to set email alerts
|

C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(14 citation statements)
references
References 4 publications
0
14
0
Order By: Relevance
“…Since common datasets often contain more than one caption per image, using multiple captions could provide additional information to better describe the whole scene. C4Synth [91] uses multiple captions by employing a crosscaption cycle consistency which ensures that a generated image is consistent with a set of semantically similar sentences. It operates sequentially by iterating over all captions and improves the image quality by distilling concepts from multiple captions [91].…”
Section: Multiple Captionsmentioning
confidence: 99%
See 3 more Smart Citations
“…Since common datasets often contain more than one caption per image, using multiple captions could provide additional information to better describe the whole scene. C4Synth [91] uses multiple captions by employing a crosscaption cycle consistency which ensures that a generated image is consistent with a set of semantically similar sentences. It operates sequentially by iterating over all captions and improves the image quality by distilling concepts from multiple captions [91].…”
Section: Multiple Captionsmentioning
confidence: 99%
“…C4Synth [91] uses multiple captions by employing a crosscaption cycle consistency which ensures that a generated image is consistent with a set of semantically similar sentences. It operates sequentially by iterating over all captions and improves the image quality by distilling concepts from multiple captions [91].…”
Section: Multiple Captionsmentioning
confidence: 99%
See 2 more Smart Citations
“…Text-guided Synthesis Pioneered by GAN-INT-CLS [36] and GAWWN [37], conditional generative adversarial networks (GANs) [12] have been the dominant framework for text-based image synthesis [19,23,32,42,46,50]. Recent work DALL-E [34] shows promising results with transformers [43] and discrete VAE [35] by leveraging web-scale data.…”
Section: Related Workmentioning
confidence: 99%