Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2021
DOI: 10.1145/3436754
|View full text |Cite
|
Sign up to set email alerts
|

Global-Local Enhancement Network for NMF-Aware Sign Language Recognition

Abstract: Sign language recognition (SLR) is a challenging problem, involving complex manual features (i.e., hand gestures) and fine-grained non-manual features (NMFs) (i.e., facial expression, mouth shapes, etc .). Although manual features are dominant, non-manual features also play an important role in the expression of a sign word. Specifically, many sign words convey different meanings due to non-manual features, even though they share the same hand gestures. This ambiguity introduces great c… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(32 citation statements)
references
References 67 publications
0
20
0
Order By: Relevance
“…We observe that our approach can preserve fine-grained identity attributes of the source image, such as face and hair. In the future, we will continue exploring the potential of applying this method to other relevant research fields, such as data augmentation for sign language recognition [11], [66]- [70] and clothing / makeup try-on according to keypoints [71]- [73].…”
Section: Discussionmentioning
confidence: 99%
“…We observe that our approach can preserve fine-grained identity attributes of the source image, such as face and hair. In the future, we will continue exploring the potential of applying this method to other relevant research fields, such as data augmentation for sign language recognition [11], [66]- [70] and clothing / makeup try-on according to keypoints [71]- [73].…”
Section: Discussionmentioning
confidence: 99%
“…They used the MSASL dataset to transfer the ASL knowledge to recognize GSL (German Sign Language) on the SIGNUM dataset and achieved an accuracy of 0.75 for high target data. Hu et al [140] pointed out non-manual feature-aware GLEN (Global local enhancement network) based on the SLR model. They achieved a top 1 accuracy of 69.9% for NMFs-CSL datasets and 96.8% for isolated SLR 500 datasets.…”
Section: B Study Of Current State-of-the-art Models For Sign Language...mentioning
confidence: 99%
“…cSLR aims at transcribing videobased sign language into gloss sequence. With the released of larger-scale cSLR datasets [28], numerous researches burst out implementing sign language recognition tasks in an end-to-end manner [1][2][3][4][5][6][7][8][9][10][11][12][13]. The gloss annotations are in same order with sign language, this monotonic relationship significantly ease the syntactic alignment with the cSLR methods.…”
Section: A Sign Language Recognitionmentioning
confidence: 99%
“…Sign Language Recognition (SLR) and Translation (SLT) aim at converting the video-based sign languages into sign gloss sequences and spoken language sentences, respectively. Most previous works in this field focus on continuous SLR with the gloss supervision [1][2][3][4][5][6][7][8][9][10][11][12][13], few attempts have been made for SLT [14][15][16][17]. The main difference is that gloss labels are in the same order with sign gestures, and thus the gloss annotations significantly ease the syntactic alignment under the SLR methods.…”
Section: Introductionmentioning
confidence: 99%