2023
DOI: 10.48550/arxiv.2301.04742
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

HADA: A Graph-based Amalgamation Framework in Image-text Retrieval

Abstract: Many models have been proposed for vision and language tasks, especially the image-text retrieval task. All state-of-the-art (SOTA) models in this challenge contained hundreds of millions of parameters. They also were pretrained on a large external dataset that has been proven to make a big improvement in overall performance. It is not easy to propose a new model with a novel architecture and intensively train it on a massive dataset with many GPUs to surpass many SOTA models, which are already available to us… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 30 publications
(71 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?