2022
DOI: 10.3390/s22030706
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Hand Gesture Recognition Using Fine-Tuned Convolutional Neural Network

Abstract: Hand gesture recognition is one of the most effective modes of interaction between humans and computers due to being highly flexible and user-friendly. A real-time hand gesture recognition system should aim to develop a user-independent interface with high recognition performance. Nowadays, convolutional neural networks (CNNs) show high recognition rates in image classification problems. Due to the unavailability of large labeled image samples in static hand gesture images, it is a challenging task to train de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 76 publications
(47 citation statements)
references
References 34 publications
0
33
0
Order By: Relevance
“…This study focuses on a literature review of hand gesture strategies and discusses their pros and limits in various situations. In addition, the performance of these methods is tabulated, with an emphasis on computer vision techniques that deal with similarity and difference points; hand segmentation techniques; classification algorithms and limitations; number and types of gestures; dataset used; detection range (distance); and camera type [ 51 ]. Convolutional neural networks (CNN) are used to categorize images of hand gestures.…”
Section: Significant Research Work On Hand Gesture Recognitionmentioning
confidence: 99%
“…This study focuses on a literature review of hand gesture strategies and discusses their pros and limits in various situations. In addition, the performance of these methods is tabulated, with an emphasis on computer vision techniques that deal with similarity and difference points; hand segmentation techniques; classification algorithms and limitations; number and types of gestures; dataset used; detection range (distance); and camera type [ 51 ]. Convolutional neural networks (CNN) are used to categorize images of hand gestures.…”
Section: Significant Research Work On Hand Gesture Recognitionmentioning
confidence: 99%
“…The vision-based group consists of 2D and 3D approaches, as shown in Table 1. The 2D method [21][22][23][24][25][26][27][28][29][30][31][32][33] uses video and image data for sign language interpretation. For example, a frequency-based model [21,22] has been proposed which uses a histogram, discrete wavelet transform, and a Gabor filter for extracting features from 2D images.…”
Section: Related Workmentioning
confidence: 99%
“…Subsequently, the motion-based model [25] was proposed, which uses the motion of the hand as a feature via hand trajectory tracking. Finally, the deep-based model [26][27][28][29][30][31][32][33][34][35] has been proposed, which uses a novel deep-learning-based architecture to learn spatial hierarchies of features automatically and adaptively for 2D images, such as the convolutional neural network model (CNN), the convolutional neural network-long short-term memory model (CNN-LSTM), and recurrent neural networks (RNN). However, the 2D approach has less information than the 3D approach, which makes it hard to solve the problem of similar gestures based on a backhand view.…”
Section: Related Workmentioning
confidence: 99%
“…There are various ways to calculate positional encodings, which can be obtained by learning or being fixed directly [39]. In this work, we use fixed positional encoding based on sine and cosine functions of different frequencies as shown in Equation ( 5) and ( 6): PE(pos, 2i) = sin pos 10, 000 2i/d ( 5) PE(pos, 2i + 1) = cos pos 10, 000 2i/d (6) where PE() denotes the function of positional encodings, pos denotes the position, i represents the dimension in the positional encoding vector, and d is a base parameter of transformer that denotes the size of the hidden layer at each position. We do this via positional encoding.…”
Section: Attention-based Encodermentioning
confidence: 99%
“…Compared to other physical input devices, hand gestures provide a more natural and convenient way for humans to interact with devices. Computer vision-based gesture recognition is a form of technology that uses a computing module to read and interpret hand movements as commands [1][2][3][4][5][6]. Computer visionbased gesture recognition technology can be used in several industries, such as interactive entertainment, smart home, VR/AR, and sign language machine translation [7][8][9][10][11][12][13].…”
Section: Introductionmentioning
confidence: 99%