2019
DOI: 10.1609/aaai.v33i01.33018303
|View full text |Cite
|
Sign up to set email alerts
|

I Know the Relationships: Zero-Shot Action Recognition via Two-Stream Graph Convolutional Networks and Knowledge Graphs

Abstract: Recently, with the ever-growing action categories, zero-shot action recognition (ZSAR) has been achieved by automatically mining the underlying concepts (e.g., actions, attributes) in videos. However, most existing methods only exploit the visual cues of these concepts but ignore external knowledge information for modeling explicit relationships between them. In fact, humans have remarkable ability to transfer knowledge learned from familiar classes to recognize unfamiliar classes. To narrow the knowledge gap … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
80
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 165 publications
(81 citation statements)
references
References 27 publications
0
80
1
Order By: Relevance
“…Table 1 presents Phrase Detection rel=1 rel=10 rel=70 rel=1 rel=10 rel=70 Recall at 50 100 50 100 50 100 50 100 50 100 50 100 VTransE [34] 19. 4 2) Adopting a higher order structured output space ( = 40) outperforms lower order one ( = 150) which verifies the effectiveness of the HSA module.…”
Section: Ablation Studymentioning
confidence: 76%
See 1 more Smart Citation
“…Table 1 presents Phrase Detection rel=1 rel=10 rel=70 rel=1 rel=10 rel=70 Recall at 50 100 50 100 50 100 50 100 50 100 50 100 VTransE [34] 19. 4 2) Adopting a higher order structured output space ( = 40) outperforms lower order one ( = 150) which verifies the effectiveness of the HSA module.…”
Section: Ablation Studymentioning
confidence: 76%
“…In this way, the correlations between known classes and unknown classes can help to transfer the knowledge learned from the training classes to the unknown test classes by mapping the embeddings to visual classifiers. Knowledge Graphs (KGs) effectively capture explicit relational knowledge about individual entities hence many methods [4,6,9,25,33] use KGs to learn the class correlations. In scene graph generation, the relation classes are correlated by object classes as in the knowledge graph and the structural information is vital for a well-defined output space.…”
Section: Related Workmentioning
confidence: 99%
“…Action recognition is the task of recognising the sequence of actions from the frames of a video. However, if the new actions are not available when training, Zero-shot learning can be a solution, such as in [45,107,124,149]. Zero-shot Style Transfer in an image is the problem of transferring the texture of source image to target image while the style is not pre-determined and it is arbitrary [151].…”
Section: Applicationsmentioning
confidence: 99%
“…But these methods usually ignore the temporal information of videos, which take significant advantages for visual understanding [32]. [5] proposed a zero-shot action recognition framework using both the visual clues and external knowledge to show relations between objects and actions, also applied selfattention to model the temporal information of videos. Transfer learning In many real-world applications, it is expensive to re-collect training data and re-model when task changes [33].…”
Section: Related Workmentioning
confidence: 99%