2023
DOI: 10.1109/access.2022.3231130
|View full text |Cite
|
Sign up to set email alerts
|

SIGNFORMER: DeepVision Transformer for Sign Language Recognition

Abstract: Sign language is the most common form of communication for the deaf and dumb. To bridge the communication gap with such impaired people, normal people should be able to recognize signs. Therefore, it is necessary to introduce a sign language recognition system to assist such impaired people. This paper proposes the Transformer Encoder as a useful tool for sign language recognition. For the recognition of static Indian signs, the authors have implemented a vision transformer. To recognize static Indian sign lan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
4

Relationship

2
8

Authors

Journals

citations
Cited by 38 publications
(9 citation statements)
references
References 20 publications
0
9
0
Order By: Relevance
“…The computational features that have been previously described are processed by a three-layer MLP [ 27 ] network to detect the liveness of fingerprints. For the sake of clarity, a description of the training procedure for the Dual Attention model is provided in the form of Algorithm 1 down below.…”
Section: Methodsmentioning
confidence: 99%
“…The computational features that have been previously described are processed by a three-layer MLP [ 27 ] network to detect the liveness of fingerprints. For the sake of clarity, a description of the training procedure for the Dual Attention model is provided in the form of Algorithm 1 down below.…”
Section: Methodsmentioning
confidence: 99%
“…This model has 3.8 million trainable parameters and an average inference time of 2.23 s, which is comparatively more than the proposed model. The vision transformer in [34] used eight layers of the encoder with four heads with seven million parameters, which is computationally expensive. The authors in [21] have used ensemble network architecture employing ResNet50 with an attention module with more training parameters and epochs.…”
Section: Time Complexity and Order Of The Proposed Methodsmentioning
confidence: 99%
“…Several methods have been proposed domestically and internationally, from traditional to deep learning-based. Deep learning (DL), a crucial segment of machine learning, has gained prominence in medical image analysis, driving the pursuit of artificial intelligence (AI) in medical imaging [15], [16]. It significantly contributes to computer vision and medical image analysis, using neural networks to aid specialists in diagnosis, reduce radiologists' workload, and enhance efficiency.…”
Section: Medical Research Heavily Relies On Medical Image Analysis a ...mentioning
confidence: 99%