2022
DOI: 10.3390/app12115523
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Recognition of Mexican Sign Language Using a Depth Camera and Recurrent Neural Networks

Abstract: Automatic sign language recognition is a challenging task in machine learning and computer vision. Most works have focused on recognizing sign language using hand gestures only. However, body motion and facial gestures play an essential role in sign language interaction. Taking this into account, we introduce an automatic sign language recognition system based on multiple gestures, including hands, body, and face. We used a depth camera (OAK-D) to obtain the 3D coordinates of the motions and recurrent neural n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(28 citation statements)
references
References 25 publications
0
18
0
Order By: Relevance
“…Amon them, Amorim et al developed 67 skeleton key points based on 20 classes of the ASLLVD dataset, which include word level sign language word recognition using the enhancement of [21] by adding graphic layout [22]. Solis et al collected 30 Mexican sign language (MSL) word skeletons considering body, face and hand information using a spatial camera, then applied RNN and LSTM and achieved good ac-curacy [14]. Xia et al made a dataset by considering 67 whole body key points and achieved satisfactory performance using RNN with their self-development dataset [23].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Amon them, Amorim et al developed 67 skeleton key points based on 20 classes of the ASLLVD dataset, which include word level sign language word recognition using the enhancement of [21] by adding graphic layout [22]. Solis et al collected 30 Mexican sign language (MSL) word skeletons considering body, face and hand information using a spatial camera, then applied RNN and LSTM and achieved good ac-curacy [14]. Xia et al made a dataset by considering 67 whole body key points and achieved satisfactory performance using RNN with their self-development dataset [23].…”
Section: Related Workmentioning
confidence: 99%
“…This dataset was collected from 30 different MSL signs with 25 repetitions for each sign. They recorded 3000 samples for the dataset in total, specifically 20 videos for each sign and extracted 20 frames from each video [14].…”
Section: B Msl Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…Mejia-Perez et al assessed different architectures of recurrent neural networks (RNNs) for the recognition of 26 dynamic words and four static alphabet letters [20]. Four people performed each sign 25 times against a controlled background while wearing black clothes.…”
Section: Related Workmentioning
confidence: 99%