2020
DOI: 10.1007/s11042-020-09048-5
|View full text |Cite
|
Sign up to set email alerts
|

Video-based isolated hand sign language recognition using a deep cascaded model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
31
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 61 publications
(31 citation statements)
references
References 26 publications
0
31
0
Order By: Relevance
“…On the other hand, some studies use the combination of deep learning and traditional methods. Rastgoo et al [18] used some handcrafted features and 2D-CNNs to obtain spatial information.…”
Section: Related Workmentioning
confidence: 99%
“…On the other hand, some studies use the combination of deep learning and traditional methods. Rastgoo et al [18] used some handcrafted features and 2D-CNNs to obtain spatial information.…”
Section: Related Workmentioning
confidence: 99%
“…They obtained an accuracy of 92.60% when using only 2D-CNN, 97.30% when using only 3D-CNN, and 99.20% when using only the fusion model. Rastgoo et al [16] presented a deep-based model for effective hand sign recognition by training: first, the single shot detector (SSD) model for hand identification using annotated videos of five online sign dictionaries, and second, a combinational model with a CNN and different spatial features. They performed a comprehensive study of sequence learning utilizing various pre-train models, spatial features, and temporal-based models.…”
Section: Related Workmentioning
confidence: 99%
“…Isolated sign language recognition refers to the task of accurately detecting single sign gestures from videos and thus it is usually tackled similar to action and gesture recognition, as well as other types of video processing and classification tasks with the extraction and learning of highly discriminative features [ 63 , 64 , 65 ]. In the literature, a common approach to the task of isolated sign language recognition is the extraction of hand and mouth regions from the video sequences in an attempt to remove noisy backgrounds that can inhibit classification performance.…”
Section: Sign Language Recognitionmentioning
confidence: 99%
“…In an effort to derive more discriminative features, Rastgoo et al in [ 63 ], proposed a multi-stream SLR method that gets as input hand regions, 3D hand pose features and Extra Spatial Hand Relation features (i.e., orientation and slope of hands). These features were concatenated and fed to an LSTM layer to derive the sign class.…”
Section: Sign Language Recognitionmentioning
confidence: 99%