Jannis Vamvas scite author profile

Jannis Vamvas

5Publications

14Citation Statements Received

211Citation Statements Given

How they've been cited

How they cite others

113

201

Affiliations

University of Zurich

Publications

Order By: Most citations

Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias

Vamvas¹,

Sennrich²

2021

View full text Add to dashboard Cite

Lexical disambiguation is a major challenge for machine translation systems, especially if some senses of a word are trained less often than others. Identifying patterns of overgeneralization requires evaluation methods that are both reliable and scalable. We propose contrastive conditioning as a reference-free blackbox method for detecting disambiguation errors. Specifically, we score the quality of a translation by conditioning on variants of the source that provide contrastive disambiguation cues. After validating our method, we apply it in a case study to perform a targeted evaluation of sequence-level knowledge distillation. By probing word sense disambiguation and translation of gendered occupation names, we show that distillation-trained models tend to overgeneralize more than other models with a comparable BLEU score. Contrastive conditioning thus highlights a side effect of distillation that is not fully captured by standard evaluation metrics. Code and data to reproduce our findings are publicly available. 1

show abstract

On the Limits of Minimal Pairs in Contrastive Evaluation

Vamvas¹,

Sennrich²

2021

View full text Add to dashboard Cite

Minimal sentence pairs are frequently used to analyze the behavior of language models. It is often assumed that model behavior on contrastive pairs is predictive of model behavior at large. We argue that two conditions are necessary for this assumption to hold: First, a tested hypothesis should be well-motivated, since experiments show that contrastive evaluation can lead to false positives. Secondly, test data should be chosen such as to minimize distributional discrepancy between evaluation time and deployment time. For a good approximation of deployment-time decoding, we recommend that minimal pairs are created based on machine-generated text, as opposed to humanwritten references. We present a contrastive evaluation suite for English-German MT that implements this recommendation. 1

show abstract

As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning

Vamvas¹,

Sennrich²

2022

Preprint

View full text Add to dashboard Cite

As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning

Vamvas¹,

Sennrich²

2022

View full text Add to dashboard Cite

show abstract

Data and Code for X-Stance

Vamvas¹,

Sennrich²

2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jannis Vamvas

Contrastive Conditioning for Assessing Disambiguation in MT: A Case Study of Distilled Bias

On the Limits of Minimal Pairs in Contrastive Evaluation

As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning

As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning

Data and Code for X-Stance

Contact Info

Product

Resources

About