ModDrop: Adaptive Multi-Modal Gesture Recognition

Neverova, Natalia; Wolf, Christian; Taylor, Graham W.; Nebout, Florian

doi:10.1109/tpami.2015.2461544

Cited by 288 publications

(244 citation statements)

References 55 publications

Supporting

Mentioning

228

Contrasting

Unclassified

Order By: Relevance

“…Inspired by the recent progress in the field of deep learning 2D Convolutional Neural Networks (2D CNNs) have been applied to the Gesture Recognition field in order to extract spatial features [7], [8]. In recent studies, features were either concatenated into fixed sized gesture templates [7] or passed to HMM [13] or Recurrent Neural Networks [8] in order to model the temporal aspects of the gestures.…”

Section: D Convolutional Neural Networkmentioning

confidence: 99%

“…In recent studies, features were either concatenated into fixed sized gesture templates [7] or passed to HMM [13] or Recurrent Neural Networks [8] in order to model the temporal aspects of the gestures.…”

Section: D Convolutional Neural Networkmentioning

confidence: 99%

“…With the emergence of consumer depth cameras [5], researchers quickly incorporated depth sensors into their systems, as depth simplifies the task of human pose estimation [6]. Many state-of-the-art gesture recognition systems today use depth images as a modality or as a means of preprocessing their data before recognizing gestures [2], [7], [8].…”

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

“…In recent years, Convolutional Neural Network (CNN) based approaches have achieved state-of-the-art performance in gesture recognition challenges [7], [8]. In [7], Neverova et al proposed a multi-scale and multi-modal deep learning architecture to spot and recognize continuous gestures, and achieved state-of-the-art performance in the ChaLearn 2014 Gesture Recognition challenge [22]. In [8], Pigou et al proposed temporally modeling the spatial features obtained from CNNs by using Recurrent Neural Networks (RNNs) with Long Short-Term Memory units, and shows the benefits of using RNNs over temporal pooling approaches.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition

Camgöz

Hadfield

Koller

et al. 2016

2016 23rd International Conference on Pattern Recognition (ICPR)

View full text Add to dashboard Cite

Abstract-In this paper, we propose using 3D Convolutional Neural Networks for large scale user-independent continuous gesture recognition. We have trained an end-to-end deep network for continuous gesture recognition (jointly learning both the feature representation and the classifier). The network performs three-dimensional (i.e. space-time) convolutions to extract features related to both the appearance and motion from volumes of color frames. Space-time invariance of the extracted features is encoded via pooling layers. The earlier stages of the network are partially initialized using the work of Tran et al. before being adapted to the task of gesture recognition. An earlier version of the proposed method, which was trained for 11,250 iterations, was submitted to ChaLearn 2016 Continuous Gesture Recognition Challenge and ranked 2nd with the Mean Jaccard Index Score of 0.269235. When the proposed method was further trained for 28,750 iterations, it achieved state-of-the-art performance on the same dataset, yielding a 0.314779 Mean Jaccard Index Score.

show abstract

Section: D Convolutional Neural Networkmentioning

confidence: 99%

Section: D Convolutional Neural Networkmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition

Camgöz

Hadfield

Koller

et al. 2016

2016 23rd International Conference on Pattern Recognition (ICPR)

View full text Add to dashboard Cite

show abstract

Angle based hand gesture recognition using graph convolutional network

Aiman,

Ahmad

2023

Computer Animation & Virtual

View full text Add to dashboard Cite

Hand gesture recognition has attracted huge interest in the areas of autonomous driving, human computer systems, gaming and many others. Skeleton based techniques along with graph convolutional networks (GCNs) are being popularly used in this field due to the easy estimation of joint coordinates and better representation capability of graphs. Simple hand skeleton graphs are unable to capture the finer details and complex spatial features of hand gestures. To address these challenges, this work proposes an “angle‐based hand gesture graph convolutional network” (AHG‐GCN). This model introduces two additional types of novel edges in the graph to connect the wrist with each fingertip and finger's base, explicitly capturing their relationship, which plays an important role in differentiating gestures. Besides, novel features for each skeleton joint are designed using the angles formed with fingertip/finger‐base joints and the distance among them to extract semantic correlation and tackle the overfitting problem. Thus, an enhanced set of 25 features for each joint is obtained using these novel techniques. The proposed model achieves 90% and 88% accuracy for 14 and 28 gesture configurations for the DHG 14/28 dataset and, 94.05% and 89.4% accuracy for 14 and 28 gesture configurations for the SHREC 2017 dataset, respectively.

show abstract

Literature review of vision‐based dynamic gesture recognition using deep learning techniques

Jain

Karsh

Barbhuiya

2022

Concurrency and Computation

View full text Add to dashboard Cite

Summary Gesture recognition is the foremost need in building intelligent human‐computer interaction systems to solve many day‐to‐day problems and simplify human life in this digital world. The traditional machine learning (ML) algorithm tried to capture specific handcrafted features, failed miserably in some real‐world environments. Deep learning (DL) techniques have become a sensation among researchers in recent years, making the traditional ML approaches quite obsolete. However, existing reviews consider only a few datasets on which DL algorithm has been applied, and the categorization of the DL algorithms is vague in their review. This study provides the precise categorization of DL algorithms and considers around 15 gesture datasets on which these techniques have been applied. This study also provides a brief overview of the numerous challenging dataset available among the research community and insight into various challenges and limitations of a DL algorithm in vision‐based dynamic gesture recognition.

show abstract

ModDrop: Adaptive Multi-Modal Gesture Recognition

Cited by 288 publications

References 55 publications

Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition

Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition

Angle based hand gesture recognition using graph convolutional network

Literature review of vision‐based dynamic gesture recognition using deep learning techniques

Contact Info

Product

Resources

About