2021
DOI: 10.1109/tmm.2020.3025661
|View full text |Cite
|
Sign up to set email alerts
|

Explore Video Clip Order With Self-Supervised and Curriculum Learning for Video Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 49 publications
0
7
0
Order By: Relevance
“…possible snippet orders. Since the snippet order prediction is purely a proxy task of our TCG framework and our focus is the learning of 3D CNNs, we restrict the number of snippets of a video between 3 to 4 to alleviate the complexity of the order prediction task, inspired by the previous works [35,57,56]. The snippets are sampled uniformly from the video with the interval of p frames.…”
Section: Sample and Shufflementioning
confidence: 99%
“…possible snippet orders. Since the snippet order prediction is purely a proxy task of our TCG framework and our focus is the learning of 3D CNNs, we restrict the number of snippets of a video between 3 to 4 to alleviate the complexity of the order prediction task, inspired by the previous works [35,57,56]. The snippets are sampled uniformly from the video with the interval of p frames.…”
Section: Sample and Shufflementioning
confidence: 99%
“…3. First, we follow previous video representation learning works [48,47] and perform late feature fusion, where we extract features from the input shot clips and then perform a hierarchical fusion of features. The combined features are then passed to a classifier network to predict the order (see Fig.…”
Section: Shot Sequence Orderingmentioning
confidence: 99%
“…Such a human-like learning strategy considerably improves the model's capability. Curriculum learning has been widely used in the computer vision area [13,23,48]. With the continuous expansion of transformer in computer vision, we propose to use a curriculum learning strategy aiming at improving our Dual-GCN model's performance on image captioning task.…”
Section: Related Workmentioning
confidence: 99%