2022
DOI: 10.48550/arxiv.2207.12795
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Static and Dynamic Concepts for Self-supervised Video Representation Learning

Abstract: In this paper, we propose a novel learning scheme for selfsupervised video representation learning. Motivated by how humans understand videos, we propose to first learn general visual concepts then attend to discriminative local areas for video understanding. Specifically, we utilize static frame and frame difference to help decouple static and dynamic concepts, and respectively align the concept distributions in latent space. We add diversity and fidelity regularizations to guarantee that we learn a compact s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 64 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?