Gesture and Sign Language Recognition with Temporal Residual Networks

Pigou, Lionel; Herreweghe, Mieke Van; Dambre, Joni

doi:10.1109/iccvw.2017.365

Cited by 50 publications

(32 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…CNNs have also been used in attempts to resolve the challenging task of gesture and sign language recognition in a constant video stream. For instance, Pigou et al [126] used a deep learning approach and temporal convolutions to address this problem. The CNN model featured certain improvements that made it easier to conduct the classification process.…”

Section: ) Deep Learning Techniquesmentioning

confidence: 99%

“…However, this approach proved to be extremely challenging because the animations were difficult to work with after processing. While exploring the challenges of continuous translation, Pigou et al [126] observed that deep residual networks can be used to learn patterns in continuous videos containing gestures and signs. The use of deep residual networks can minimize the need for preprocessing.…”

Section: A Slr Continuous Modelsmentioning

confidence: 99%

“…In addition, methods that rely on sensors or customized input devices were not given proper consideration. Individual ap- Sign Language [13], [39], [44], [52], [72]- [74], [76], [95], [96], [116]- [120], [125], [128], [131], [134] American Sign Language [38] Italian Sign Language [40], [121], [169] Arabic Sign Language [49]- [51], [65], [71], [81], [97] Chinese Sign Language [70] Argentine Sign Language [63], [64] Danish and New Zealand Sign Language [45], [122] Bengali Sign Language [3], [66], [77], [95], [133] German Sign Language [67] Japanese Sign Language [68], [115], [123], [124] Indian Sign Language [69], [130], [132] Indonesian Sign Language [46] Portuguese Sign Language [126] Dutch Sign Language…”

Section: Related Studiesmentioning

confidence: 99%

See 2 more Smart Citations

Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues

Al‐Qurishi¹,

Khalid²,

Souissi³

2021

IEEE Access

View full text Add to dashboard Cite

People with hearing impairments are found worldwide; therefore, the development of effective local level sign language recognition (SLR) tools is essential. We conducted a comprehensive review of automated sign language recognition based on machine/deep learning methods and techniques published between 2014 and 2021 and concluded that the current methods require conceptual classification to interpret all available data correctly. Thus, we turned our attention to elements that are common to almost all sign language recognition methodologies. This paper discusses their relative strengths and weaknesses, and we propose a general framework for researchers. This study also indicates that input modalities bear great significance in this field; it appears that recognition based on a combination of data sources, including vision-based and sensor-based channels, is superior to a unimodal analysis. In addition, recent advances have allowed researchers to move from simple recognition of sign language characters and words towards the capacity to translate continuous sign language communication with minimal delay. Many of the presented models are relatively effective for a range of tasks, but none currently possess the necessary generalization potential for commercial deployment. However, the pace of research is encouraging, and further progress is expected if specific difficulties are resolved.

show abstract

Section: ) Deep Learning Techniquesmentioning

confidence: 99%

Section: A Slr Continuous Modelsmentioning

confidence: 99%

Section: Related Studiesmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues

Al‐Qurishi¹,

Khalid²,

Souissi³

2021

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Although there has been much research related to technologies for the deaf or hard of hearing (HoH) over the past three decades, much of this work has focused on the translation of sign language into voice or text using camera-based or wearable devices. Although sensor augmented gloves [1]- [3] have been reported to typically yield higher gesture recognition rates than camera-based systems [4]- [6], they cannot capture the intricacies of sign languages presented through head and body movements. In contrast, video can capture facial expressions; but require adequate light and a direct line-of-sight to be effective.…”

Section: Introductionmentioning

confidence: 99%

“…Because it is much easier to recruit hearing participants than deaf participants, many studies on ASL recognition (e.g. [6], [11]- [13]) have used imitation signing data, despite its differences from native ASL data.…”

Section: Introductionmentioning

confidence: 99%

Word-Level ASL Recognition and Trigger Sign Detection with RF Sensors

Rahman

Kurtoğlu

Mdrafi

et al. 2021

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Current research in the recognition of American Sign Language (ASL) has focused on perception using video or wearable gloves. However, deaf ASL users have expressed concern about the invasion of privacy with video, as well as the interference with daily activity and restrictions on movement presented by wearable gloves. In contrast, RF sensors can mitigate these issues as it is a non-contact ambient sensor that is effective in the dark and can penetrate clothes, while only recording speed and distance. Thus, this paper investigates RF sensing as an alternative sensing modality for ASL recognition to facilitate interactive devices and smart environments for the deaf and hardof-hearing. In particular, the recognition of up to 20 ASL signs, sequential classification of signing mixed with daily activity, and detection of a trigger sign to initiate human-computer interaction (HCI) via RF sensors is presented. Results yield %91.3 ASL word-level classification accuracy, %92.3 sequential recognition accuracy, 0.93 trigger recognition rate.

show abstract