Dana Ruiter scite author profile

Dana Ruiter

20Publications

58Citation Statements Received

360Citation Statements Given

How they've been cited

How they cite others

315

359

Affiliations

Saarland University

Publications

Order By: Most citations

Self-Supervised Neural Machine Translation

Ruiter¹,

España-Bonet²,

Genabith³

2019

View full text Add to dashboard Cite

We present a simple new method where an emergent NMT system is used for simultaneously selecting training data and learning internal NMT representations. This is done in a self-supervised way without parallel data, in such a way that both tasks enhance each other during training. The method is language independent, introduces no additional hyper-parameters, and achieves BLEU scores of 29.21 (en2f r) and 27.36 (f r2en) on new-stest2014 using English and French Wikipedia data for training.

show abstract

Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation

Ruiter

Genabith

España-Bonet

2020

View full text Add to dashboard Cite

Self-supervised neural machine translation (SSNMT) jointly learns to identify and select suitable training data from comparable (rather than parallel) corpora and to translate, in a way that the two tasks support each other in a virtuous circle. In this study, we provide an in-depth analysis of the sampling choices the SSNMT model makes during training. We show how, without it having been told to do so, the model self-selects samples of increasing (i) complexity and (ii) task-relevance in combination with (iii) performing a denoising curriculum. We observe that the dynamics of the mutual-supervision signals of both system internal representation types are vital for the extraction and translation performance. We show that in terms of the Gunning-Fog Readability index, SSNMT starts extracting and learning from Wikipedia data suitable for high school students and quickly moves towards content suitable for first year undergraduate students.

show abstract

HUMAN: Hierarchical Universal Modular ANnotator

Wolf¹,

Ruiter

D'Sa

et al. 2020

View full text Add to dashboard Cite

A lot of real-world phenomena are complex and cannot be captured by single task annotations. This causes a need for subsequent annotations, with interdependent questions and answers describing the nature of the subject at hand. Even in the case a phenomenon is easily captured by a single task, the high specialisation of most annotation tools can result in having to switch to another tool if the task only slightly changes. We introduce HU-MAN, a novel web-based annotation tool that addresses the above problems by a) covering a variety of annotation tasks on both textual and image data, and b) the usage of an internal deterministic state machine, allowing the researcher to chain different annotation tasks in an interdependent manner. Further, the modular nature of the tool makes it easy to define new annotation tasks and integrate machine learning algorithms e.g., for active learning. HUMAN comes with an easy-to-use graphical user interface that simplifies the annotation task and management.

show abstract

Emoji-Based Transfer Learning for Sentiment Tasks

Boy¹,

Ruiter²,

Klakow³

2021

View full text Add to dashboard Cite

Sentiment tasks such as hate speech detection and sentiment analysis, especially when performed on languages other than English, are often low-resource. In this study, we exploit the emotional information encoded in emojis to enhance the performance on a variety of sentiment tasks. This is done using a transfer learning approach, where the parameters learned by an emoji-based source task are transferred to a sentiment target task. We analyse the efficacy of the transfer under three conditions, i.e. i) the emoji content and ii) label distribution of the target task as well as iii) the difference between monolingually and multilingually learned source tasks. We find i.a. that the transfer is most beneficial if the target task is balanced with high emoji content. Monolingually learned source tasks have the benefit of taking into account the culturally specific use of emojis and gain up to F1 +0.280 over the baseline.

show abstract

StereoKG: Data-Driven Knowledge Graph Construction For Cultural Knowledge and Stereotypes

Deshpande¹,

Ruiter²,

Mosbach³

et al. 2022

View full text Add to dashboard Cite

Analyzing ethnic or religious bias is important for improving fairness, accountability, and transparency of natural language processing models. However, many techniques rely on human-compiled lists of bias terms, which are expensive to create and are limited in coverage. In this study, we present a fully datadriven pipeline for generating a knowledge graph (KG) of cultural knowledge and stereotypes. Our resulting KG covers 5 religious groups and 5 nationalities and can easily be extended to include more entities. Our human evaluation shows that the majority (59.2%) of non-singleton entries are coherent and complete stereotypes. We further show that performing intermediate masked language model training on the verbalized KG leads to a higher level of cultural awareness in the model and has the potential to increase classification performance on knowledge-crucial samples on a related task, i.e., hate speech detection.

show abstract

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

Adelani¹,

Alabi²,

Fan³

et al. 2022

Preprint

View full text Add to dashboard Cite

Recent advances in the pre-training of language models leverage large-scale datasets to create multilingual models. However, lowresource languages are mostly left out in these datasets. This is primarily because many widely spoken languages are not well represented on the web and therefore excluded from the large-scale crawls used to create datasets. Furthermore, downstream users of these models are restricted to the selection of languages originally chosen for pre-training. This work investigates how to optimally leverage existing pre-trained models to create low-resource translation systems for 16 African languages. We focus on two questions: 1) How can pretrained models be used for languages not included in the initial pre-training? and 2) How can the resulting translation models effectively transfer to new domains? To answer these questions, we create a new African news corpus covering 16 languages, of which eight languages are not part of any existing evaluation dataset. We demonstrate that the most effective strategy for transferring both to additional languages and to additional domains is to finetune large pre-trained models on small quantities of high-quality translation data.

show abstract

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

Adelani¹,

Alabi²,

Fan³

et al. 2022

View full text Add to dashboard Cite

Recent advances in the pre-training of language models leverage large-scale datasets to create multilingual models. However, low-resource languages are mostly left out in these datasets. This is primarily because many widely spoken languages are not well represented on the web and therefore excluded from the large-scale crawls used to create datasets. Furthermore, downstream users of these models are restricted to the selection of languages originally chosen for pre-training. This work investigates how to optimally leverage existing pre-trained models to create low-resource translation systems for 16 African languages. We focus on two questions: 1) How can pre-trained models be used for languages not included in the initial pre-training? and 2) How can the resulting translation models effectively transfer to new domains? To answer these questions, we create a new African news corpus covering 16 languages, of which eight languages are not part of any existing evaluation dataset. We demonstrate that the most effective strategy for transferring both to additional languages and to additional domains is to fine-tune large pre-trained models on small quantities of highquality translation data.

show abstract

Exploiting Social Media Content for Self-Supervised Style Transfer

Ruiter¹,

Kleinbauer²,

España-Bonet³

et al. 2022

Preprint

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dana Ruiter

Self-Supervised Neural Machine Translation

Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation

HUMAN: Hierarchical Universal Modular ANnotator

Emoji-Based Transfer Learning for Sentiment Tasks

StereoKG: Data-Driven Knowledge Graph Construction For Cultural Knowledge and Stereotypes

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

Exploiting Social Media Content for Self-Supervised Style Transfer

Contact Info

Product

Resources

About