2020
DOI: 10.48550/arxiv.2011.10830
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Boundary-sensitive Pre-training for Temporal Localization in Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…Very recent works [2,50] have exploited some of the aforementioned techniques for better pre-training of action localization models. For example, localization-tailored data augmentation and classification is adopted by [50].…”
Section: Video Encoders In Talmentioning
confidence: 99%
See 2 more Smart Citations
“…Very recent works [2,50] have exploited some of the aforementioned techniques for better pre-training of action localization models. For example, localization-tailored data augmentation and classification is adopted by [50].…”
Section: Video Encoders In Talmentioning
confidence: 99%
“…Very recent works [2,50] have exploited some of the aforementioned techniques for better pre-training of action localization models. For example, localization-tailored data augmentation and classification is adopted by [50]. However, these works introduce a large amount of extra video data and additional stream networks, both of which are expensive in terms of memory and computation.…”
Section: Video Encoders In Talmentioning
confidence: 99%
See 1 more Smart Citation
“…Intuitively, performing feature re-calibration for task-specific features is a way to tackle this problem. Instead of finetuning the feature extractor [2,8,48] with high time and computation cost, we explore to re-calibrate the features in a more efficient manner. In this work, our intuition is simple: the RGB and FLOW features contain modal-specific information (i.e., appearance and motion information) from different perspectives of the given data.…”
Section: Introductionmentioning
confidence: 99%
“…For temporal action detection task, we need to localize and classify the target actions simultaneously. Current mainstream approaches [10,20,19] are designed in a twostage pipeline, i.e., proposal generation and action classification, and have achieved remarkable performance. Therefore, we follow this paradigm to design the solution of this challenge.…”
Section: Introductionmentioning
confidence: 99%