2021
DOI: 10.1016/j.neucom.2020.12.029
|View full text |Cite
|
Sign up to set email alerts
|

3-D Relation Network for visual relation recognition in videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(5 citation statements)
references
References 12 publications
0
5
0
Order By: Relevance
“…• VRD-STGC [27], which proposes a novel slidingwindow scheme to simultaneously predict short-term and long-term relationships [27], and extracts spatiotemporal features. • 3DRN [65], which develops a 3-D CNN to learn the visual features for relation recognition in an end-to-end manner.…”
Section: Multi-expert Performancementioning
confidence: 99%
See 1 more Smart Citation
“…• VRD-STGC [27], which proposes a novel slidingwindow scheme to simultaneously predict short-term and long-term relationships [27], and extracts spatiotemporal features. • 3DRN [65], which develops a 3-D CNN to learn the visual features for relation recognition in an end-to-end manner.…”
Section: Multi-expert Performancementioning
confidence: 99%
“…Relation detection Relation tagging mAP R@50 R@100 P@1 P@5 P@10 3DRN [65] 2.47 2, but here on the VidOR dataset [62].…”
Section: Vidor Datasetmentioning
confidence: 99%
“…The metric-based meta-learning method is a non-parametric learning model, so its complexity is less than other methods. The idea is to learn the meta-knowledge of how to measure the similarity of samples between the support set and the query set from the embedding space by using feature embedding, such as matching network [23], relation network [24]. Generally, deep neural networks are used to map samples into the feature space, and cosine similarity [25] is used to measure the similarity of features, predict the category labels, calculate the loss and then back propagate to optimize the network.…”
Section: Preliminary Knowledgementioning
confidence: 99%
“…Snippet relation detection. Many before us have investigated relation detection in videos [5,11,25,30,34,38,39,42,43,44,46,53,59]. Relation in videos provide additional temporal information, important for interactions such as pushing or pulling a closed door.…”
Section: Related Workmentioning
confidence: 99%