2021
DOI: 10.1016/j.neunet.2021.07.019
|View full text |Cite
|
Sign up to set email alerts
|

Adversarial text-to-image synthesis: A review

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
38
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 124 publications
(55 citation statements)
references
References 102 publications
0
38
0
Order By: Relevance
“…These metrics do not take ground truth data into account and use a classifier pretrained on ImageNet [16] that mostly contains single-object images. Therefore, they are likely not well suited for more complex datasets [20]. To measure image-text alignment, metrics based on retrieval, captioning and object detection models have been proposed.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…These metrics do not take ground truth data into account and use a classifier pretrained on ImageNet [16] that mostly contains single-object images. Therefore, they are likely not well suited for more complex datasets [20]. To measure image-text alignment, metrics based on retrieval, captioning and object detection models have been proposed.…”
Section: Related Workmentioning
confidence: 99%
“…Semantic object accuracy (SOA) [27] measures whether an object detector can detect an object described in the text from a generated image. R-precision and image captioning based evaluation can fail when many different captions correctly describe the same image [20,27]. 4 SOA only focuses on the existence of objects, which makes it not well suited to evaluate object attributes and relation between objects [20,27].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Although much of the patient’s narrative may be told separately through text, imaging, and omics modalities,[ 63 ] there is tremendous potential to integrate semantic information contained in pathologist notes with imaging and omics modalities to capture a more holistic perspective of the patient’s health and integrate potentially useful information that could otherwise be overlooked. For instance, the semantic information contained in a report may highlight specific morphological and macro-architectural features in the correspondent biopsy specimen that an image-based deep learning model might struggle to identify without additional information.…”
Section: Discussionmentioning
confidence: 99%