2021
DOI: 10.1109/access.2021.3114871
|View full text |Cite
|
Sign up to set email alerts
|

3D Hand Gestures Segmentation and Optimized Classification Using Deep Learning

Abstract: Hand gestures recognition system has massive applications which are mainly utilized in robotics and computer vision specially to control Unmanned Aerial Vehicles (UAV). These methods bypass the presence of electronic control to UAVs and provide an ease to the operators. In this paper, we present a method for 3D hand gestures segmentation and classification by combining MASK-RCNN with Grass Hopper Optimization. We created a private 3D and RGB hand gestures dataset using Intel Kinetic and Intel Real sense d435i … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…Vision-based marker-less hand tracking holds great promise for HCI applications, including sign language recognition, augmented reality, telesurgery, home automation, and gaming. However, its implementation faces several challenges, including tracking inaccuracy caused by complex articulated hand motion, high appearance variability, and demanding computational and real-time requirements [10][11][12][13][14]. Consequently, research in this field continues to be a challenging problem that has gained significant attention from the computer vision community.…”
Section: Related Work and Motivationmentioning
confidence: 99%
“…Vision-based marker-less hand tracking holds great promise for HCI applications, including sign language recognition, augmented reality, telesurgery, home automation, and gaming. However, its implementation faces several challenges, including tracking inaccuracy caused by complex articulated hand motion, high appearance variability, and demanding computational and real-time requirements [10][11][12][13][14]. Consequently, research in this field continues to be a challenging problem that has gained significant attention from the computer vision community.…”
Section: Related Work and Motivationmentioning
confidence: 99%
“…These signals are then classified using machine learning algorithms, which are trained with a large number of samples to achieve accurate recognition. Khan Fawad Salam et al [52] employed the MASK-RCNN combined with the Grass Hopper algorithm for classifying the obtained RGB images and hand keypoints. Jaya Prakash Sahoo et al [53] developed an end-to-end fine-tuning method for pre-trained CNN models using fractional-level fusion technology.…”
Section: Comparison and Analysismentioning
confidence: 99%
“…For example, researchers leverage PoseNet networks to delineate hand poses through feature extraction and reverse inference of hand key points [18][19][20], thereby enhancing accuracy and robustness [21,22]. Additionally, approaches utilizing deep learning networks like ResNet-50 process feature maps to output heat maps of hand target joints, yielding commendable results in multi-target recognition [23]. Some methodologies employ two-stage networks, initially predicting hands through hand masks and subsequently performing pose estimation, mutually refining to precisely locate hand key points and bolster robustness amidst complex backgrounds [24].…”
Section: Introductionmentioning
confidence: 99%