2023
DOI: 10.1109/access.2023.3247761
|View full text |Cite
|
Sign up to set email alerts
|

Signgraph: An Efficient and Accurate Pose-Based Graph Convolution Approach Toward Sign Language Recognition

Abstract: Sign language recognition (SLR) enables the deaf and speech-impaired community to integrate and communicate effectively with the rest of society. Word level or isolated SLR is a fundamental yet complex task with the main objective of using models to correctly recognize signed words. Sign language consists of very fast and complex hand, body, face movements, and mouthing cues that make the task very challenging. Several input modalities; RGB, optical Flow, RGB-D, and pose/skeleton have been proposed for SLR. Ho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(5 citation statements)
references
References 51 publications
0
2
0
Order By: Relevance
“…Because sign language is performed by multiple parts of the body, the nodes of the graph structure used in sign language recognition should respond to information from those parts. However, too many nodes do not provide additional useful information to the model, but instead introduce noise into the model, which affects the accuracy of the model [118,126]. Therefore, for skeleton-based SLR, it is important to choose the right nodes for model learning.…”
Section: C: Other Methodsmentioning
confidence: 99%
“…Because sign language is performed by multiple parts of the body, the nodes of the graph structure used in sign language recognition should respond to information from those parts. However, too many nodes do not provide additional useful information to the model, but instead introduce noise into the model, which affects the accuracy of the model [118,126]. Therefore, for skeleton-based SLR, it is important to choose the right nodes for model learning.…”
Section: C: Other Methodsmentioning
confidence: 99%
“…Also, the parameter count of this model is 5.92 M, which makes it time-consuming during training and inference. The architecture proposed in [1,2] employed a graph convolutional network (GCN) that has a time complexity of O (k × e × d + k × n × d 2 ), where the variables n, e, K, and d represent the total number of nodes, edges, layers, and dimensions of the node hidden features utilizing pose data as input.…”
Section: Computational Performance Analysismentioning
confidence: 99%
“…A lot of preprocessing is required to enhance the model's efficiency. These approaches could be more efficient in dynamic sign language gestures [1][2][3][4][5].…”
Section: Introductionmentioning
confidence: 99%
“…The WLASL dataset has the largest number of videos of American Sign Language hand gestures [73]. It has a total of 2000 hand gesture classes.…”
Section: Wlasl Datasetmentioning
confidence: 99%