2021
DOI: 10.48550/arxiv.2112.07383
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improving Human-Object Interaction Detection via Phrase Learning and Label Composition

Abstract: Human-Object Interaction (HOI) detection is a fundamental task in high-level human-centric scene understanding. We propose PhraseHOI, containing a HOI branch and a novel phrase branch, to leverage language prior and improve relation expression. Specifically, the phrase branch is supervised by semantic embeddings, whose ground truths are automatically converted from the original HOI annotations without extra human efforts. Meanwhile, a novel label composition method is proposed to deal with the long-tailed prob… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…Existing HOI detection methods can be roughly categorized into crop-based and crop-free fashions [18,19,20,21].…”
Section: Related Workmentioning
confidence: 99%
“…Existing HOI detection methods can be roughly categorized into crop-based and crop-free fashions [18,19,20,21].…”
Section: Related Workmentioning
confidence: 99%
“…Conventional HOI detectors are mainly divided into two folds, bottom-up and top-down. The bottomup pipelines [3,6,7,10,18,19,21,22,24,35,38,45] first detect all humans and objects and then associate the human-object pairs and infer their HOI types through an additional classifier. These methods are usually organized as a two-stage paradigm and worked on improving the second stage.…”
Section: Related Workmentioning
confidence: 99%