2019
DOI: 10.1609/aaai.v33i01.3301898
|View full text |Cite
|
Sign up to set email alerts
|

Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation

Abstract: Human-object interactions (HOI) recognition and pose estimation are two closely related tasks. Human pose is an essential cue for recognizing actions and localizing the interacted objects. Meanwhile, human action and their interacted objects' localizations provide guidance for pose estimation. In this paper, we propose a turbo learning framework to perform HOI recognition and pose estimation simultaneously. First, two modules are designed to enforce message passing between the tasks, i.e. pose aware HOI recogn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 6 publications
0
5
0
Order By: Relevance
“…In addition to the cropped instance features, previous methods leverage combined spatial features [3,12,10,14,9,16,46], union box features [34,39], or context features [10,40,30] to improve the accuracy of HOI detection. In order to concentrate on more interactionrelevant features, some methods utilize extra features, such as human pose [37,5,24,14], human parts [47,39,23] and language features [42,9,30,21]. However, the serial architectures of such two-stage methods impair the efficiency of HOI detection.…”
Section: Two-stage Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to the cropped instance features, previous methods leverage combined spatial features [3,12,10,14,9,16,46], union box features [34,39], or context features [10,40,30] to improve the accuracy of HOI detection. In order to concentrate on more interactionrelevant features, some methods utilize extra features, such as human pose [37,5,24,14], human parts [47,39,23] and language features [42,9,30,21]. However, the serial architectures of such two-stage methods impair the efficiency of HOI detection.…”
Section: Two-stage Methodsmentioning
confidence: 99%
“…Determining which regions to concentrate on is critical and challenging for HOI detectors. To obtain essential features for interaction prediction, conventional two-stage methods usually involve extra features, e.g., human pose [37,5,24,14] and language [42,9,30,21]. However, even with extra features, two-stage methods still focus on the detected instances that might be inaccurate, which are less adaptive and limited by the detected instances.…”
Section: Introductionmentioning
confidence: 99%
“…GPNN uses a message passing mechanism to reason upon graph structured information [23]. Feng et al proposed a turbo learning method which views human pose and HOI as complementary information to each other and optimize both tasks in an iterative manner [24]. Our proposed HRS explores the geometric relations and action relations between humans and entities.…”
Section: Vrd and Hoi-detmentioning
confidence: 99%
“…Current works pay more attention to exploring how to improve the second stage. The most recent works aim to understand HOI by capturing context information [6,26] or human structural message [25,5,4,32]. Some works [21,27,32] formulated the second stage as a graph reasoning problem and use graph convolutional network to predict the HOI.…”
Section: Related Workmentioning
confidence: 99%