2018
DOI: 10.1016/j.neucom.2018.04.015
|View full text |Cite
|
Sign up to set email alerts
|

A self-organizing neural network architecture for learning human-object interactions

Abstract: The visual recognition of transitive actions comprising human-object interactions is a key component for artificial systems operating in natural environments. This challenging task requires jointly the recognition of articulated body actions as well as the extraction of semantic elements from the scene such as the identity of the manipulated objects. In this paper, we present a self-organizing neural network for the recognition of human-object interactions from RGB-D videos. Our model consists of a hierarchy o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 33 publications
(22 citation statements)
references
References 58 publications
(94 reference statements)
0
22
0
Order By: Relevance
“…Similar GWR-based approaches have been proposed for the incremental learning of body motion patterns (Mici et al 2017, Elfaramawy et al 2017, Parisi et al 2016) and human-object interaction (Mici et al 2018). However, these unsupervised learning approaches do not take into account top-down task-relevant signals that can regulate the stability-plasticity balance, potentially leading to scalability issues for large-scale datasets.…”
Section: Dynamic Architecturesmentioning
confidence: 99%
“…Similar GWR-based approaches have been proposed for the incremental learning of body motion patterns (Mici et al 2017, Elfaramawy et al 2017, Parisi et al 2016) and human-object interaction (Mici et al 2018). However, these unsupervised learning approaches do not take into account top-down task-relevant signals that can regulate the stability-plasticity balance, potentially leading to scalability issues for large-scale datasets.…”
Section: Dynamic Architecturesmentioning
confidence: 99%
“…The neural update rate decreases as the neurons become more habituated, which has the effect of preventing that noisy input interferes with consolidated neural representations. Similar GWR-based approaches have been proposed for the incremental learning of body motion patterns [54,8,61] and human-object interaction [55].…”
Section: Growing Self-organizing Networkmentioning
confidence: 99%
“…3D features when objects are small and partially occluded. Alternatively, other methods model human-object interactions considering the skeletal features combined with the object's identity (Rybok et al, 2014;Mici et al, 2018c).…”
Section: Single-layer Approachesmentioning
confidence: 99%
“…More formally, we propose a hierarchical arrangement of Growing When Required (GWR) networks (Marsland et al, 2002 ) which integrate multiple visual cues regarding the body pose, the manipulated object, and their spatial relation during human-object interaction, accumulated over a short and a longer period of time in order to jointly learn actions and human activities respectively. In our previous research, we have successfully applied and evaluated hierarchical architectures of the GWR network for clustering human body pose and motion patterns as well as for learning prototypical representations of human-object interactions in an unsupervised fashion (Parisi et al, 2015 ; Mici et al, 2018a , c ). The generative properties of the GWR networks have been shown to be particularly suitable for the human-object interaction recognition due to generalizing well to unseen action-object pairs.…”
Section: Introductionmentioning
confidence: 99%