Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475263
|View full text |Cite
|
Sign up to set email alerts
|

Video Visual Relation Detection via Iterative Inference

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
47
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 24 publications
(47 citation statements)
references
References 43 publications
0
47
0
Order By: Relevance
“…For each video in VidVRD dataset, the model needs to predict a set of relation instances, and each relation instance contains a relation triplet with the subject and object trajectories. Following [29,28], we use two evaluation protocols on this dataset: relation detection and relation tagging. For relation detection, we count a predicted relation instance as a correct one, if its relation triplet is the same with a ground truth, and their trajectory vIoU (volume IoU) of the subject and object are both larger than the threshold of 0.5.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…For each video in VidVRD dataset, the model needs to predict a set of relation instances, and each relation instance contains a relation triplet with the subject and object trajectories. Following [29,28], we use two evaluation protocols on this dataset: relation detection and relation tagging. For relation detection, we count a predicted relation instance as a correct one, if its relation triplet is the same with a ground truth, and their trajectory vIoU (volume IoU) of the subject and object are both larger than the threshold of 0.5.…”
Section: Methodsmentioning
confidence: 99%
“…For relation detection, we count a predicted relation instance as a correct one, if its relation triplet is the same with a ground truth, and their trajectory vIoU (volume IoU) of the subject and object are both larger than the threshold of 0.5. In the same way as [29,28], we adopt Mean Average Precision (mAP), Recall@50 (R@50) and Recall@100 (R@100) to evaluate the model performance on relation detection. While in relation tagging, for a predicted relation instance, following [29,28] we only consider the correctness of its relation triplet, and ignore the precision of its subject and object trajectories.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations