2020
DOI: 10.1360/ssi-2019-0292
|View full text |Cite
|
Sign up to set email alerts
|

Cross-modal video moment retrieval based on visual-textual relationship alignment

Abstract: In recent years, increasing amounts of video resources have created a series of demands for fine retrieval of video moments, such as highlight moments in sports events and the recreation of specific video content. In this context, research on cross-modal video segment retrieval, which attempts to output a video moment that matches the input query text, is gradually emerging. Existing solutions primarily focus on global or local feature representation for query text and video moments. However, such solutions ig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
references
References 44 publications
(79 reference statements)
0
0
0
Order By: Relevance