2022
DOI: 10.48550/arxiv.2210.16031
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance

Abstract: An epic dreamscape with fantasy architecture, vivid colors, wide angle, super highly detailed, professional digital painting Legendary elegant gnome hold map and feel confuse in forest, highly detailed, global illumination, ray tracing, sharp focus Beautiful village around an ancient dragon head, massive scale, realistic concept art, cinematic color scheme, dramatic lighting (a) Samples of complex scene image generation (b) Samples of simple scene image generation Beautiful robot female with closed eyes, sci-f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(11 citation statements)
references
References 19 publications
0
11
0
Order By: Relevance
“…There are also other metrics for text-to-image evaluation, including Inception score (IS) [85] for image quality and R-precision for text-to-image generation [9]. [12] 27.10 LAFITE [60] 26.94 DALLE [11] 17.89 GLIDE [15] 12.24 Imagen [16] 7.27 Stable Diffusion [17] 12.63 VQ-Diffussion [43] 13.86 DALL-E 2 [18] 10.39 Upainting [63] 8.34 ERNIE-ViLG 2.0 [64] 6.75 eDiff-I [65] 6.95…”
Section: Technical Evaluation Of Text-to-image Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…There are also other metrics for text-to-image evaluation, including Inception score (IS) [85] for image quality and R-precision for text-to-image generation [9]. [12] 27.10 LAFITE [60] 26.94 DALLE [11] 17.89 GLIDE [15] 12.24 Imagen [16] 7.27 Stable Diffusion [17] 12.63 VQ-Diffussion [43] 13.86 DALL-E 2 [18] 10.39 Upainting [63] 8.34 ERNIE-ViLG 2.0 [64] 6.75 eDiff-I [65] 6.95…”
Section: Technical Evaluation Of Text-to-image Methodsmentioning
confidence: 99%
“…Recent evaluation benchmarks. Apart from the automatic metrics discussed above, multiple works involve human evaluation and propose their new evaluation benchmarks [14], [16], [63], [73], [80], [86], [87]. We summarize representative benchmarks in Table 2.…”
Section: Technical Evaluation Of Text-to-image Methodsmentioning
confidence: 99%
See 3 more Smart Citations