2024
DOI: 10.1109/tnnls.2022.3174031
|View full text |Cite
|
Sign up to set email alerts
|

MEN: Mutual Enhancement Networks for Sign Language Recognition and Education

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 88 publications
0
5
0
Order By: Relevance
“…Various other appearance-based baselines have also been proposed in [15] including a) 2D CNN + Gated Recurrent Unit (GRU) and b) 3D-CNN claiming the best results obtained by the I3D network. In [29], a SLR and education system is proposed. This SLR system is built upon a spatiotemporal network for semantic category identification of a given sign video while the education system detects the failure mode of learners and guides them to sign correctly.…”
Section: Related Workmentioning
confidence: 99%
“…Various other appearance-based baselines have also been proposed in [15] including a) 2D CNN + Gated Recurrent Unit (GRU) and b) 3D-CNN claiming the best results obtained by the I3D network. In [29], a SLR and education system is proposed. This SLR system is built upon a spatiotemporal network for semantic category identification of a given sign video while the education system detects the failure mode of learners and guides them to sign correctly.…”
Section: Related Workmentioning
confidence: 99%
“…Another advantage of ReLU is that it helps to address the problem of vanishing gradients, which can occur when using other activation functions such as sigmoid or tanh. This is because ReLU does not saturate for positive input values, which means that it does not cause the gradients to become small, and thus, it does not hinder the learning process [17,18]. Besides, ReLU is an optimal choice for CNN architecture in Hand Sign Recognition due to its non-linearity and computational efficiency.…”
Section: Figure 2 Proposed Architecturementioning
confidence: 99%
“…The fusion of multiple modalities provided robustness against environmental variations and improved performance in real-world scenarios. R. Li et al explored the effectiveness of transfer learning in hand sign language recognition using pre-trained CNN models [8]. By fine-tuning CNN architectures pre-trained on large-scale image datasets, they achieved notable improvements in recognition accuracy, demonstrating the potential of transfer learning for this task.…”
Section: Introductionmentioning
confidence: 99%