Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475472
|View full text |Cite
|
Sign up to set email alerts
|

Counterfactual Debiasing Inference for Compositional Action Recognition

Abstract: Compositional action recognition is a novel challenge in the computer vision community and focuses on revealing the different combinations of verbs and nouns instead of treating subject-object interactions in videos as individual instances only. Existing methods tackle this challenging task by simply ignoring appearance information or fusing object appearances with dynamic instance tracklets. However, those strategies usually do not perform well for unseen action instances. For that, in this work we propose a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…To verify the validity of the results, we compare them with the I3D [1] , STIN [6] , SAFCAR [13] , and CDF [9] models. Our model can be realized based on existing benchmark models, so SAFCAR and Interactive Fusion are used as benchmark models.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To verify the validity of the results, we compare them with the I3D [1] , STIN [6] , SAFCAR [13] , and CDF [9] models. Our model can be realized based on existing benchmark models, so SAFCAR and Interactive Fusion are used as benchmark models.…”
Section: Resultsmentioning
confidence: 99%
“…But this cannot avoid the relationship between the object category of the labeled frame and the action. Sun et al [9] proposed to use the counterfactual algorithm to solve the problem of the visual information brought about by the representation role. Our present work, on the other hand, pays more attention to some irrelevant background visual information as well as the interference of action recognition caused by irrelevant actions, so that the model learns the motion patterns in the core action data.…”
Section: Combined Action Recognitionmentioning
confidence: 99%
“…It noted that fertilizers consumption has a statistically significant correlation with all variables except food price inflation. As the evidence of highly correlated variables can affect the precise estimation of regression which might lead to bias inference [61], the study achieves a multi-collinearity protection omission or inclusion of explanatory variables within the study model. The tested values of VIF are lower than 2 and the results of mean VIF are less than 5 (1.09), beside the degree of tolerance for explanatory variables is greater than 0.5 (Table 4) suggesting that there is no multicollinearity between the explanatory variables, so the population, fertilizers consumption and food prices inflation are included in the study models as social, agri-environmental, and economic keys challenges are affected the food security indicators in GCC countries.…”
mentioning
confidence: 99%