2021
DOI: 10.48550/arxiv.2102.01282
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Progressive Localization Networks for Language-based Moment Localization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 45 publications
0
1
0
Order By: Relevance
“…In addition, 2D-TAN [32] enumerate all possible segments as proposal candidates and convert them into 2D feature map, then a temporal adjacent network is proposed to obtain multi-modal representation and encode the video context information. Following this, [19,20,35] design more complicated cross-modal reasoning strategies to learn the video-language semantic alignment from both coarse and finegrained granularities.…”
Section: Short-form Video Temporal Groundingmentioning
confidence: 99%
“…In addition, 2D-TAN [32] enumerate all possible segments as proposal candidates and convert them into 2D feature map, then a temporal adjacent network is proposed to obtain multi-modal representation and encode the video context information. Following this, [19,20,35] design more complicated cross-modal reasoning strategies to learn the video-language semantic alignment from both coarse and finegrained granularities.…”
Section: Short-form Video Temporal Groundingmentioning
confidence: 99%