Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition

Islam, Muhammad; Aloraini, Mohammed; Aladhadh, Suliman; Habib, Shabana; Khan, Asma; Alabdulatif, Abduatif; Alanazi, Turki M.

doi:10.3390/s23229068

Cited by 3 publications

(3 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The exceptional performance of the hand gesture recognition system is rooted in meticulous dataset preparation, encompassing a diverse array of lighting conditions and subjects, in conjunction with deploying the EfficientNet B3 model within the CNN framework [26]. The scalability and efficiency inherent to this model were instrumental in achieving a balanced and effective learning process, thereby facilitating the system's ability to recognize gestures with high accuracy under varying environmental conditions and across different individuals [27].…”

Section: Discussionmentioning

confidence: 99%

Deep Learning Enhanced Hand Gesture Recognition for Efficient Drone use in Agriculture

Srinil,

Thongnim

2024

ijacsa

View full text Add to dashboard Cite

The use of deep learning in unmanned aerial vehicles (UAVs), or drones, has greatly improved various technologies by making complex tasks easier, faster, and requiring less human help. This study looks into how artificial intelligence (AI) can be used in farming, especially through creating a system where drones can be controlled by hand gestures to support agricultural activities. By using a special type of AI called a Convolutional Neural Network (CNN) with an EfficientNet B3 model, this research developed a gesture recognition system. It was trained on 1,393 pictures of different hand signals taken under various light conditions and from three different people. The system was evaluated based on its training and testing performance, showing very high scores in terms of loss, accuracy, F1 score, and the Area Under the Curve (AUC), which means it can recognize gestures accurately and work well in different situations. This has big implications for farming, as it gives farmers an easy way to control drones for tasks like checking on crops and spraying them precisely, which also helps keep them safe. This study is an important step towards smarter farming practices. Moreover, the system's ability to perform well in different settings shows it could also be useful in other areas like construction, where drones need to operate precisely and flexibly.

show abstract

Section: Discussionmentioning

confidence: 99%

Deep Learning Enhanced Hand Gesture Recognition for Efficient Drone use in Agriculture

Srinil,

Thongnim

2024

ijacsa

View full text Add to dashboard Cite

show abstract

“…The work in [22] proposed an LSTM-based network, further enriched by the determinantal point process. Since this groundbreaking work, LSTM has emerged as a cornerstone for VS, with frequently advanced techniques developing [25,33,34]. For instance, the work [33] introduced a novel loss to gauge the fidelity of predicted summaries in preserving original semantic information.…”

Section: Supervised and Unsupervised Video Summarizationmentioning

confidence: 99%

“…Since this groundbreaking work, LSTM has emerged as a cornerstone for VS, with frequently advanced techniques developing [25,33,34]. For instance, the work [33] introduced a novel loss to gauge the fidelity of predicted summaries in preserving original semantic information. The study [35] devises VS as a temporal interest detection challenge addressed by the anticipated DSNet.…”

Section: Supervised and Unsupervised Video Summarizationmentioning

confidence: 99%

Effective Video Summarization Using Channel Attention-Assisted Encoder–Decoder Framework

Alharbi,

Habib,

Albattah

et al. 2024

Symmetry

View full text Add to dashboard Cite

A significant number of cameras regularly generate massive amounts of data, demanding hardware, time, and labor resources to acquire, process, and monitor. Asymmetric frames within videos pose a challenge to automatic summarization of videos, making it challenging to capture key content. Developments in computer vision have accelerated the seamless capture and analysis of high-resolution video content. Video summarization (VS) has garnered considerable interest due to its ability to provide concise summaries of lengthy videos. The current literature mainly relies on a reduced set of representative features implemented using shallow sequential networks. Therefore, this work utilizes an optimal feature-assisted visual intelligence framework for representative feature selection and summarization. Initially, the empirical analysis of several features is performed, and ultimately, we adopt a fine-tuning InceptionV3 backbone for feature extraction, deviating from conventional approaches. Secondly, our strategic encoder–decoder module captures complex relationships with five convolutional blocks and two convolution transpose blocks. Thirdly, we introduced a channel attention mechanism, illuminating interrelations between channels and prioritizing essential patterns to grasp complex refinement features for final summary generation. Additionally, comprehensive experiments and ablation studies validate our framework's exceptional performance, consistently surpassing state-of-the-art networks on two benchmarks (TVSum and SumMe) datasets.

show abstract

Real-time Arabic avatar for deaf-mute communication enabled by deep learning sign language translation

Talaat,

El-Shafai,

Soliman

et al. 2024

Computers and Electrical Engineering

View full text Add to dashboard Cite

Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition

Cited by 3 publications

References 58 publications

Deep Learning Enhanced Hand Gesture Recognition for Efficient Drone use in Agriculture

Deep Learning Enhanced Hand Gesture Recognition for Efficient Drone use in Agriculture

Effective Video Summarization Using Channel Attention-Assisted Encoder–Decoder Framework

Real-time Arabic avatar for deaf-mute communication enabled by deep learning sign language translation

Contact Info

Product

Resources

About