2021
DOI: 10.48550/arxiv.2104.00240
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Self-supervised Motion Learning from Static Images

Abstract: Motions are reflected in videos as the movement of pixels, and actions are essentially patterns of inconsistent motions between the foreground and the background. To well distinguish the actions, especially those with complicated spatio-temporal interactions, correctly locating the prominent motion areas is of crucial importance. However, most motion information in existing videos are difficult to label and training a model with good motion representations with supervision will thus require a large amount of h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
5

Relationship

5
0

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 54 publications
(87 reference statements)
0
5
0
Order By: Relevance
“…Transfer learning is an important measure to improve the generalization ability of the model. Supervised training [24,26,8,29,25,11] as well as unsupervised ones [15,13,21] fore, we adopt the former strategy. Recently, Transformer-Based methods have shown great potential in image recognition [10,33] and video understanding [1,3].…”
Section: Pre-train Of Classification Modelsmentioning
confidence: 99%
“…Transfer learning is an important measure to improve the generalization ability of the model. Supervised training [24,26,8,29,25,11] as well as unsupervised ones [15,13,21] fore, we adopt the former strategy. Recently, Transformer-Based methods have shown great potential in image recognition [10,33] and video understanding [1,3].…”
Section: Pre-train Of Classification Modelsmentioning
confidence: 99%
“…There are multiple ways to prepare the pre-trained model, such as supervised pre-training [17,7,1,4] as is used in [14,13,18] as well as unsupervised ones [10,9]. Here we adopt the supervised pre-training as it yields a better downstream performance.…”
Section: Initialization Preparationmentioning
confidence: 99%
“…The existing mainstream pre-training methods can be divided into two types: supervised [21, 9, 1, 8] and unsupervised [13,11]. Supervised methods can achieve stronger performance, but need to provide labels for each video.…”
Section: Training Backbonesmentioning
confidence: 99%