2022
DOI: 10.3390/jimaging8070192
|View full text |Cite
|
Sign up to set email alerts
|

Sign and Human Action Detection Using Deep Learning

Abstract: Human beings usually rely on communication to express their feeling and ideas and to solve disputes among themselves. A major component required for effective communication is language. Language can occur in different forms, including written symbols, gestures, and vocalizations. It is usually essential for all of the communicating parties to be fully conversant with a common language. However, to date this has not been the case between speech-impaired people who use sign language and people who use spoken lan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 44 publications
(80 reference statements)
0
8
0
Order By: Relevance
“…In Table 3 and Figure 9, the overall results of the SLR-AROSNN method are compared with recent approaches (Dhulipala et al, 2022). Based on accu y , the SLR-AROSNN of 98.77, 98.62, 95.42, 97.32, and 98.63%, correspondingly.…”
Section: Journal Of Disability Research 2023mentioning
confidence: 99%
“…In Table 3 and Figure 9, the overall results of the SLR-AROSNN method are compared with recent approaches (Dhulipala et al, 2022). Based on accu y , the SLR-AROSNN of 98.77, 98.62, 95.42, 97.32, and 98.63%, correspondingly.…”
Section: Journal Of Disability Research 2023mentioning
confidence: 99%
“…We'll connect each neuron to an input volume section. Convolutional layers form CNNs (Rao et al, 2023b), the layer's parameters are a set of learnable kernels with a small receptive field but full input volume (Liu et al, 2023;Dhulipala et al, 2022). In the forward pass, each filter convolved across the width and height of the input volume computes the dot product between its entries and the input and creates a 2-dimensional activation map shown in Figure 2.…”
Section: Convolution Layermentioning
confidence: 99%
“…CNN and LSTM employ vision-based data. The CNN model performed the best, achieving training and testing accuracies of 98.8% and 97.4%, whereas the LSTM model performed poorly, achieving training and testing accuracies of 49.4% and 48.8%, respectively [29]. This study presents the use of a four-beam patch antenna as a sensor node to evaluate the pillrolling effect in Parkinson's disease using the S-band sensing technique at 2.4 GHz.…”
Section: Introductionmentioning
confidence: 99%