Emergent Communication Pretraining for Few-Shot Machine Translation

Li, Yaoyiran; Ponti, Edoardo Maria; Vulić, Ivan; Korhonen, Anna

doi:10.18653/v1/2020.coling-main.416

Cited by 7 publications

(19 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We also establish the non-triviality of such a transfer performance by comparing to other synthetic and natural source corpora, as well as multiple ablation studies on the EC and downstream transfer setups to help understand the transferability of emergent language. Notably, our method of corpus transfer significantly outperforms directly transferring the trained emergent speaker model (Li et al, 2020b), demonstrating that modeling the emergent language could yield greater usefulness than directly transferring the EC agents.…”

Section: Speakermentioning

confidence: 97%

“…A typical setup is the image referential game (Figure 1(a)), where a speaker generates a discrete sequence of tokens based on an input image, a listener is challenged to select the input out of distractors based on the message, and both networks are optimized jointly via game success signals. By studying these games, researchers are interested in the emergence of desirable properties resembling natural language, such as game success generalization Lazaridou & Baroni, 2020) and compositionality (Smith et al, 2003;Kirby et al, 2015;Lazaridou et al, 2018;Li et al, 2020b). However, these properties are mostly defined and analyzed within each individual game framework.…”

Section: Speakermentioning

confidence: 99%

“…by pre-training a model on a corpus of emergent messages produced by a trained emergent speaker, and fine-tuning the model on downstream tasks with natural language data. This approach integrates inspirations from prior work like Papadimitriou & Jurafsky (2020), which proposes the transfer scheme from synthetic to natural corpora, and Li et al (2020b), which aims to improve few-shot translation by transferring the trained EC speaker and listener model weights. Through a series of experiments, we find that corpus transfer is helpful when the downstream natural language resource is limited.…”

Section: Speakermentioning

confidence: 99%

“…There have been efforts to incorporate communication signals and natural language supervision to better communicate (Lowe et al, 2020), avoid language drift , or to improve NLP tasks like vision-language navigation (Fried et al, 2018), translation , and image captioning (Havrylov & Titov, 2017). Different from these setups where the speaker is also grounded on natural language annotations, Li et al (2020b) propose to separate the EC game, where no natural language is involved, and fine-tuning on a downstream translation task with limited natural language data. Despite being the closest work to ours, such a model transfer approach is fundamentally different from our corpus transfer in several ways.…”

Section: Use Of Emergent Communication Beyond Gamesmentioning

confidence: 99%

“…Despite being the closest work to ours, such a model transfer approach is fundamentally different from our corpus transfer in several ways. First, while Li et al (2020b) mainly aim to improve downstream tasks, we envision transfer as a novel means to evaluating and analyzing emergent languages. Also, the emergent language is a representation with properties beyond the parameters of specific EC models (Section 3.1).…”

Section: Use Of Emergent Communication Beyond Gamesmentioning

confidence: 99%

See 4 more Smart Citations

Linking Emergent and Natural Languages via Corpus Transfer

Yao¹,

Yu²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

The study of language emergence aims to understand how human languages are shaped by perceptual grounding and communicative intent. Computational approaches to emergent communication (EC) predominantly consider referential games in limited domains and analyze the learned protocol within the game framework. As a result, it remains unclear how the emergent languages 1 from these settings connect to natural languages or provide benefits in real-world language processing tasks, where statistical models trained on large text corpora dominate. In this work, we propose a novel way to establish such a link by corpus transfer, i.e. pretraining on a corpus of emergent language for downstream natural language tasks, which is in contrast to prior work that directly transfers speaker and listener parameters. Our approach showcases non-trivial transfer benefits for two different tasks -language modeling and image captioning. For example, in a low-resource setup (modeling 2 million natural language tokens), pre-training on an emergent language corpus with just 2 million tokens reduces model perplexity by 24.6% on average across ten natural languages. We also introduce a novel metric to predict the transferability of an emergent language by translating emergent messages to natural language captions grounded on the same images. We find that our translation-based metric highly correlates with the downstream performance on modeling natural languages (for instance ρ = 0.83 on Hebrew), while topographic similarity, a popular metric in previous work, shows surprisingly low correlation (ρ = 0.003), hinting that simple properties like attribute disentanglement from synthetic domains might not capture the full complexities of natural language. Our findings also indicate potential benefits of moving language emergence forward with natural language resources and models 2 .

show abstract

Section: Speakermentioning

confidence: 97%

Section: Speakermentioning

confidence: 99%

Section: Speakermentioning

confidence: 99%

Section: Use Of Emergent Communication Beyond Gamesmentioning

confidence: 99%

Section: Use Of Emergent Communication Beyond Gamesmentioning

confidence: 99%

See 3 more Smart Citations

Linking Emergent and Natural Languages via Corpus Transfer

Yao¹,

Yu²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

Toward More Human-Like AI Communication: A Review of Emergent Communication Research

Brandizzi

2023

IEEE Access

View full text Add to dashboard Cite

In the recent shift towards human-centric AI, the need for machines to accurately use natural language has become increasingly important. While a common approach to achieve this is to train large language models, this method presents a form of learning misalignment where the model may not capture the underlying structure and reasoning humans employ in using natural language, potentially leading to unexpected or unreliable behavior. Emergent communication (EmCom) is a field of research that has seen a growing number of publications in recent years, aiming to develop artificial agents capable of using natural language in a way that goes beyond simple discriminative tasks and can effectively communicate and learn new concepts. In this review, we present EmCom under two aspects. Firstly, we delineate all the common proprieties we find across the literature and how they relate to human interactions. Secondly, we identify two subcategories and highlight their characteristics and open challenges. We encourage researchers to work together by demonstrating that different methods can be viewed as diverse solutions to a common problem and emphasize the importance of including diverse perspectives and expertise in the field. We believe a deeper understanding of human communication and human-AI trust dynamics are crucial to develop machines that can accurately use natural language in human-machine interactions.

show abstract