Leveraging Frozen Pretrained Written Language Models for Neural Sign Language Translation

Coster, Mathieu De; Dambre, Joni

doi:10.3390/info13050220

Cited by 18 publications

(23 citation statements)

References 31 publications

(55 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…20 papers use a 2D CNN as feature extractor [6, 35-38, 64, 65, 72, 75, 77-81, 87, 88, 92, 93, 95, 98]. These are often pre-trained for image classification using the ImageNet dataset [274]; some are further pre-trained on the task of Continuous Sign Language Recognition (CSLR), e.g., [37,38,92,93]. Three papers use a subsequent 1D CNN to temporally process the resulting spatial features [64,77,80].…”

Section: Extraction Methodsmentioning

confidence: 99%

“…For instance, Koller et al presented SLR systems that exploit SignWriting [33,34], and these systems are leveraged in some later works on SLT, e.g., [35,36]. Many SLT models also use feature extractors that were pre-trained with gloss labels, e.g., [37,38].…”

Section: Notation Systems For Sign Languagesmentioning

confidence: 99%

“…Sign2Text models do not need glosses to train the translation model. Note that in some cases, these features are still extracted using a model that was pre-trained for SLR, e.g., [37,38]. This means that some Sign2Text models do indirectly require gloss level annotations for training.…”

Section: Sign2(gloss+text)mentioning

confidence: 99%

“…Sign-2Gloss2Text models are proposed in five papers [6,37,65,77,94] and (Sign2Gloss, Gloss2Text) models also in five [6,37,50,51,65]. Sign2(Gloss+Text) models are found eight times within the reviewed papers [37,38,78,80,88,89,92,99] and Sign2Text models 28 times [6, 35-37, 62, 64, 66, 67, 69, 72, 73, 75, 76, 79, 81-87, 90, 92, 93, 95-98].…”

Section: Tasksmentioning

confidence: 99%

“…Pre-trained language models are readily available for transformers (for example via the HuggingFace Transformers library [281]). De Coster et al have shown that integrating pre-trained spoken language models can improve SLT performance [38,92]. Chen et al pre-train their decoder network in two steps: first on a multilingual corpus, and then on Gloss2Text translation [99].…”

Section: Sign Language Translation Modelsmentioning

confidence: 99%

See 4 more Smart Citations

Machine translation from signed to spoken languages: state of the art and challenges

Coster

Shterionov

Herreweghe

et al. 2023

Univ Access Inf Soc

Self Cite

View full text Add to dashboard Cite

Automatic translation from signed to spoken languages is an interdisciplinary research domain on the intersection of computer vision, machine translation (MT), and linguistics. While the domain is growing in terms of popularity—the majority of scientific papers on sign language (SL) translation have been published in the past five years—research in this domain is performed mostly by computer scientists in isolation. This article presents an extensive and cross-domain overview of the work on SL translation. We first give a high level introduction to SL linguistics and MT to illustrate the requirements of automatic SL translation. Then, we present a systematic literature review of the state of the art in the domain. Finally, we outline important challenges for future research. We find that significant advances have been made on the shoulders of spoken language MT research. However, current approaches often lack linguistic motivation or are not adapted to the different characteristics of SLs. We explore challenges related to the representation of SL data, the collection of datasets and the evaluation of SL translation models. We advocate for interdisciplinary research and for grounding future research in linguistic analysis of SLs. Furthermore, the inclusion of deaf and hearing end users of SL translation applications in use case identification, data collection, and evaluation, is of utmost importance in the creation of useful SL translation models.

show abstract

Section: Extraction Methodsmentioning

confidence: 99%

Section: Notation Systems For Sign Languagesmentioning

confidence: 99%

Section: Sign2(gloss+text)mentioning

confidence: 99%

Section: Tasksmentioning

confidence: 99%

Section: Sign Language Translation Modelsmentioning

confidence: 99%

See 3 more Smart Citations

Machine translation from signed to spoken languages: state of the art and challenges

Coster

Shterionov

Herreweghe

et al. 2023

Univ Access Inf Soc

Self Cite

View full text Add to dashboard Cite

show abstract

Error Analysis of Pretrained Language Models (PLMs) in English-to-Arabic Machine Translation

Al-Khalifa,

Al-Khalefah,

Haroon

2024

Hum-Cent Intell Syst

View full text Add to dashboard Cite

Advances in neural machine translation utilizing pretrained language models (PLMs) have shown promise in improving the translation quality between diverse languages. However, translation from English to languages with complex morphology, such as Arabic, remains challenging. This study investigated the prevailing error patterns of state-of-the-art PLMs when translating from English to Arabic across different text domains. Through empirical analysis using automatic metrics (chrF, BERTScore, COMET) and manual evaluation with the Multidimensional Quality Metrics (MQM) framework, we compared Google Translate and five PLMs (Helsinki, Marefa, Facebook, GPT-3.5-turbo, and GPT-4). Key findings provide valuable insights into current PLM limitations in handling aspects of Arabic grammar and vocabulary while also informing future improvements for advancing English–Arabic machine translation capabilities and accessibility.

show abstract

A survey on Sign Language machine translation

Núñez-Marcos¹,

Perez-de-Viñaspre²,

Labaka³

2023

Expert Systems with Applications

View full text Add to dashboard Cite

Leveraging Frozen Pretrained Written Language Models for Neural Sign Language Translation

Cited by 18 publications

References 31 publications

Machine translation from signed to spoken languages: state of the art and challenges

Machine translation from signed to spoken languages: state of the art and challenges

Error Analysis of Pretrained Language Models (PLMs) in English-to-Arabic Machine Translation

A survey on Sign Language machine translation

Contact Info

Product

Resources

About