2024
DOI: 10.1007/s40747-023-01341-8
|View full text |Cite
|
Sign up to set email alerts
|

Exploring better image captioning with grid features

Jie Yan,
Yuxiang Xie,
Yanming Guo
et al.

Abstract: Nowadays, Artificial Intelligence Generated Content (AIGC) has shown promising prospects in both computer vision and natural language processing communities. Meanwhile, as an essential aspect of AIGC, image to captions has received much more attention. Recent vision-language research is developing from the bulky region visual representations based on object detectors toward more convenient and flexible grid ones. However, this kind of research typically concentrates on image understanding tasks like image clas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 56 publications
(110 reference statements)
0
0
0
Order By: Relevance