2023
DOI: 10.1016/j.eswa.2023.120698
|View full text |Cite
|
Sign up to set email alerts
|

Image captioning based on scene graphs: A survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 150 publications
0
1
0
Order By: Relevance
“…These solutions, whether they rely on transformers and attention mechanisms [6,7,8], or scene graphs as presented in [9], in which learning is supervised, or relying on beam search analysis or gated recurrent units (GRU) units, in which learning is unsupervised [10,11], generate one single sentence for each input image. Such models are trained on RGB image datasets [12,13].…”
Section: Sentence Captioningmentioning
confidence: 99%
“…These solutions, whether they rely on transformers and attention mechanisms [6,7,8], or scene graphs as presented in [9], in which learning is supervised, or relying on beam search analysis or gated recurrent units (GRU) units, in which learning is unsupervised [10,11], generate one single sentence for each input image. Such models are trained on RGB image datasets [12,13].…”
Section: Sentence Captioningmentioning
confidence: 99%