Proceedings of the 3rd International Conference on Robotics, Control and Automation 2018
DOI: 10.1145/3265639.3265672
|View full text |Cite
|
Sign up to set email alerts
|

Action Recognition with 3D ConvNet-GRU Architecture

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 28 publications
0
9
0
Order By: Relevance
“…In [13], a 2D CNN-LSTM architecture, where, in parallel with the LSTMs, it also uses a weakly supervised gloss-detection regularization network, consisting of stacked temporal 1D convolutions. A simpler variant of LSTMs, Gated-recurrent Units (GRU) [11], which consist of only two gates (update and reset gates), and have the internal state (output state) fully exposed, have also been used for temporal modelling [54].…”
Section: Image Appearance Based Methodsmentioning
confidence: 99%
“…In [13], a 2D CNN-LSTM architecture, where, in parallel with the LSTMs, it also uses a weakly supervised gloss-detection regularization network, consisting of stacked temporal 1D convolutions. A simpler variant of LSTMs, Gated-recurrent Units (GRU) [11], which consist of only two gates (update and reset gates), and have the internal state (output state) fully exposed, have also been used for temporal modelling [54].…”
Section: Image Appearance Based Methodsmentioning
confidence: 99%
“…When taking video sequences as as input, typical problems are the classification of actions or motion planning. Here DCNN networks are used for example the VGG [51] architecture [52]), as well as Self Organizing Maps network which in [53] receives information about the human's location and pose in a robot work space based on pressure activated notes in a safety mat.…”
Section: E Research Questionsmentioning
confidence: 99%
“…For action representation, the solutions that have achieved the most success are based upon optical flows [4], point clouds [5], convolutional neural networks (CNN) [6,7] and landmark detection of the main joints of the human body (i.e., skeleton-data) [8]. On the other hand, for action classification, previous attempts vary from random forests [9], to recurrent neural networks (RNN) [10,11] and more recently, graph neural networks (GNN).…”
Section: Introductionmentioning
confidence: 99%