2020
DOI: 10.48550/arxiv.2007.10662
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Fine-Grained Image Captioning with Global-Local Discriminative Objective

Abstract: Significant progress has been made in recent years in image captioning, an active topic in the fields of vision and language. However, existing methods tend to yield overly general captions and consist of some of the most frequent words/phrases, resulting in inaccurate and indistinguishable descriptions (see Figure 1). This is primarily due to (i) the conservative characteristic of traditional training objectives that drives the model to generate correct but hardly discriminative captions for similar images an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 69 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?