2023
DOI: 10.1145/3556537
|View full text |Cite
|
Sign up to set email alerts
|

A Survey on Video Moment Localization

Abstract: Video moment localization, also known as video moment retrieval, aiming to search a target segment within a video described by a given natural language query. Beyond the task of temporal action localization whereby the target actions are pre-defined, video moment retrieval can query arbitrary complex activities. In this survey paper, we aim to present a comprehensive review of existing video moment localization techniques, including supervised, weakly supervised, and unsupervised ones. We also review the datas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 114 publications
0
1
0
Order By: Relevance
“…For activity detection, a tIoU threshold was considered when matching the predicted activities to human annotated ones, following the standard practice 41 , 42 . The tIoU between a predicted activity and a human annotated activity was computed as the intersection of the two events divided by their union.…”
Section: Methodsmentioning
confidence: 99%
“…For activity detection, a tIoU threshold was considered when matching the predicted activities to human annotated ones, following the standard practice 41 , 42 . The tIoU between a predicted activity and a human annotated activity was computed as the intersection of the two events divided by their union.…”
Section: Methodsmentioning
confidence: 99%
“…The user then selects which videos to watch and discards the ones they don't like, resulting in wasted transmission resources. However, if not all the videos are transmitted, the user may 3 experience buffering or a decrease in video quality, which can significantly impact their viewing experience. This issue involves how to recommend videos to the user, whether to transmit all or part of the videos, and how to allocate video resources, among other challenges.…”
Section: Streamingmentioning
confidence: 99%