2019
DOI: 10.1109/access.2019.2904749
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Sign Language Recognition Based on Video Sequence With BLSTM-3D Residual Networks

Abstract: Sign language recognition aims to recognize meaningful movements of hand gestures and is a significant solution in intelligent communication between the deaf community and hearing societies. However, until now, the current dynamic sign language recognition methods still have some drawbacks with difficulties of recognizing complex hand gestures, low recognition accuracy for most dynamic sign language recognition, and potential problems in larger video sequence data training. In order to solve these issues, this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
50
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 131 publications
(59 citation statements)
references
References 46 publications
0
50
0
Order By: Relevance
“…The hidden state of the current clip is input into a softmax layer to estimate classconditional probabilities using connectionist temporal classification as a cost function. Liao et al [32] developed a deep 3-dimensional residual ConvNet and bi-directional LSTM networks for dynamic sign language recognition. Hand object was localized in the video frames using faster R-CNN, then a 3D ResNet jointly extracts spatial and temporal features from the input image sequences which classified using bidirectional LSTM.…”
Section: Related Workmentioning
confidence: 99%
“…The hidden state of the current clip is input into a softmax layer to estimate classconditional probabilities using connectionist temporal classification as a cost function. Liao et al [32] developed a deep 3-dimensional residual ConvNet and bi-directional LSTM networks for dynamic sign language recognition. Hand object was localized in the video frames using faster R-CNN, then a 3D ResNet jointly extracts spatial and temporal features from the input image sequences which classified using bidirectional LSTM.…”
Section: Related Workmentioning
confidence: 99%
“…As can be seen from Table 3, major Chinese Sign Language Recognition-related papers and their focus were listed, including conferences, journals, and workshops. These articles contain sensor-based and vision-based recognition methods, introducing some advanced and fashionable technologies such as SVM [23,24,39,40,101,102], DTW [103][104][105], HMM [20,22,23,101,[106][107][108][109][110][111][112][113][114], LSTM [115], ANN [116], CNN (3D-CNN) [70,71,117,118], HOD, HOG [23,24,103], and PCA [20]. Hidden Markov model (HMM) is a general processing method of Sign Language Recognition.…”
Section: Investigation Of Chinese Sign Language Recognitionmentioning
confidence: 99%
“…Validation on a private Chinese Sign Language vocabulary shows it has superiority to traditional HMM method. An advanced dynamic SLR method called BLSTM-3D ResNet was presented by Liao et al [115], which included a deep three-dimensional residual convolutional network and bidirectional LSTM networks. They localized the hand object from video frames and extracted spatial-temporal features by BLSTM-3D ResNet.…”
Section: Investigation Of Chinese Sign Language Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent years, many researchers shift their attention from the traditional methods to convolutional neural networks (CNNs) [13][14][15] since they have achieved remarkable success in many important tasks of computer vision, such as classification, detection, and recognition. Lots of approaches have been proposed to solve the problem of tiny-face detection, which aims to search a tiny face in a whole image, especially in a low-resolution image.…”
Section: Introductionmentioning
confidence: 99%