2020
DOI: 10.4218/etrij.2019-0230
|View full text |Cite
|
Sign up to set email alerts
|

Three‐stream network with context convolution module for human–object interaction detection

Abstract: Human–object interaction (HOI) detection is a popular computer vision task that detects interactions between humans and objects. This task can be useful in many applications that require a deeper understanding of semantic scenes. Current HOI detection networks typically consist of a feature extractor followed by detection layers comprising small filters (eg, 1 × 1 or 3 × 3). Although small filters can capture local spatial features with a few parameters, they fail to capture larger context information relevant… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…The FBPN is combined with faster RCNN, which is modified by adopting a focal loss function to reduce the imbalance between complex and easy samples to promote detection of small objects. Other HBB implementations include the detection of vehicle number plates [21] and human–object interactions [22].…”
Section: Related Workmentioning
confidence: 99%
“…The FBPN is combined with faster RCNN, which is modified by adopting a focal loss function to reduce the imbalance between complex and easy samples to promote detection of small objects. Other HBB implementations include the detection of vehicle number plates [21] and human–object interactions [22].…”
Section: Related Workmentioning
confidence: 99%