2024
DOI: 10.1609/aaai.v38i4.28183
|View full text |Cite
|
Sign up to set email alerts
|

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

Yifan Lu,
Ziqi Zhang,
Chunfeng Yuan
et al.

Abstract: Diverse video captioning aims to generate a set of sentences to describe the given video in various aspects. Mainstream methods are trained with independent pairs of a video and a caption from its ground-truth set without exploiting the intra-set relationship, resulting in low diversity of generated captions. Different from them, we formulate diverse captioning into a semantic-concept-guided set prediction (SCG-SP) problem by fitting the predicted caption set to the ground-truth set, where the set-level relati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 34 publications
(49 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?