Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems

Razumovskaia, Evgeniia; Glavaš, Goran; Majewska, Olga; Ponti, Edoardo Maria; Korhonen, Anna; Vulić, Ivan

doi:10.1613/jair.1.13083

Cited by 20 publications

(16 citation statements)

References 286 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…While earlier datasets focused on a single domain (Henderson et al, 2014a,b;Wen et al, 2017), the focus shifted towards the more realistic multi-domain task-oriented dialogs with the creation of the Mul-tiWOZ dataset , which has been refined and improved in several iterations Zang et al, 2020;Han et al, 2021). Due to the particularly high costs of creating TOD datasets (in comparison with other language understanding tasks) (Razumovskaia et al, 2021), only a handful of monolingual TOD datasets Table 5: Per-language few-shot transfer performance (sample efficiency results) on DST and RR for the baseline TOD-XLMR and the best specialized model (TLM+RS-Mono on OS). in languages other than English (Zhu et al, 2020) or bilingual TOD datasets have been created (Gunasekara et al, 2020;Lin et al, 2021).…”

Section: Few-shot Transfer and Sample Efficiencymentioning

confidence: 99%

“…This lack can be attributed to the fact that creating TOD datasets for new languages from scratch or via translation of English datasets is significantly more expensive and time-consuming than for most other NLP tasks. However, the absence of multilingual datasets that are comparable (i.e., aligned) across languages prevents a reliable estimate of effectiveness of cross-lingual transfer techniques in multi-domain TOD (Razumovskaia et al, 2021).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Hung¹,

Lauscher²,

Vulić³

et al. 2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

Research on (multi-domain) task-oriented dialog (TOD) has predominantly focused on the English language, primarily due to the shortage of robust TOD datasets in other languages, preventing the systematic investigation of cross-lingual transfer for this crucial NLP application area. In this work, we introduce MULTI 2 WOZ, a new multilingual multidomain TOD dataset, derived from the wellestablished English dataset MULTIWOZ, that spans four typologically diverse languages: Chinese, German, Arabic, and Russian. In contrast to concurrent efforts (Ding et al., 2021;Zuo et al., 2021), MULTI 2 WOZ contains goldstandard dialogs in target languages that are directly comparable with development and test portions of the English dataset, enabling reliable and comparative estimates of cross-lingual transfer performance for TOD. We then introduce a new framework for multilingual conversational specialization of pretrained language models (PrLMs) that aims to facilitate crosslingual transfer for arbitrary downstream TOD tasks. Using such conversational PrLMs specialized for concrete target languages, we systematically benchmark a number of zero-shot and few-shot cross-lingual transfer approaches on two standard TOD tasks: Dialog State Tracking and Response Retrieval. Our experiments show that, in most setups, the best performance entails the combination of (i) conversational specialization in the target language and (ii) few-shot transfer for the concrete TOD task. Most importantly, we show that our conversational specialization in the target language allows for an exceptionally sample-efficient fewshot transfer for downstream TOD tasks.

show abstract

Section: Few-shot Transfer and Sample Efficiencymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Hung¹,

Lauscher²,

Vulić³

et al. 2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

show abstract

“…Fine-tuning a large multilingual LM has become a standard for multilingual NLU (Zhang et al, 2019;Kulshreshtha et al, 2020). However, the excessively high data annotation costs for multiple domains and languages still hinder progress in multilingual dialogue (Razumovskaia et al, 2021). In this paper, unlike prior work, we propose to use external unannotated data to mine and automatically label in-domain in-language examples which aid learning in low-data regimes across multiple languages.…”

Section: Related Work and Backgroundmentioning

confidence: 99%

“…At the same time, porting an NLU system to any new domain and language requires collecting a large indomain dataset, and training a model for the target language . Such in-domain annotations in multiple languages are extremely expensive and time-consuming (Rastogi et al, 2020), also reflected in the fact that large enough dialogue NLU datasets for other languages are still few and far between (Razumovskaia et al, 2021). This in turn creates the demand for strong multilingual and crosslingual methods which generalise well and learn effectively in zero-shot and few-shot scenarios.…”

Section: Introductionmentioning

confidence: 99%

Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue

Razumovskaia¹,

Vulić²,

Korhonen³

2022

Findings of the Association for Computational Linguistics: ACL 2022

Self Cite

View full text Add to dashboard Cite

Scaling dialogue systems to a multitude of domains, tasks and languages relies on costly and time-consuming data annotation for different domain-task-language configurations. The annotation efforts might be substantially reduced by the methods that generalise well in zero-and few-shot scenarios, and also effectively leverage external unannotated data sources (e.g., Web-scale corpora). We propose two methods to this aim, offering improved dialogue natural language understanding (NLU) across multiple languages: 1) Multi-SentAugment, and 2) LayerAgg. Multi-SentAugment is a self-training method which augments available (typically few-shot) training data with similar (automatically labelled) in-domain sentences from large monolingual Web-scale corpora. LayerAgg learns to select and combine useful semantic information scattered across different layers of a Transformer model (e.g., mBERT); it is especially suited for zero-shot scenarios as semantically richer representations should strengthen the model's cross-lingual capabilities. Applying the two methods with state-of-the-art NLU models obtains consistent improvements across two standard multilingual NLU datasets covering 16 diverse languages. The gains are observed in zero-shot, few-shot, and even in full-data scenarios. The results also suggest that the two methods achieve a synergistic effect: the best overall performance in few-shot setups is attained when the methods are used together.

show abstract

“…This allows mBERT to share embeddings across languages, which achieves promising performance on various cross-lingual NLP tasks; 2) Ensemble-Net. Razumovskaia et al (2021) propose an Ensemble-Net where predictions are determined by 8 independent models through majority voting, each separately trained on a single source language, which achieves promising performance on zero-shot cross-lingual SLU; 3) AR-S2S-PTR. Rongali et al (2020) proposed a unified sequence-to-sequence models with pointer generator network for cross-lingual SLU; 4) IT-S2S-PTR.…”

Section: Baselinesmentioning

confidence: 99%

GL-CLeF: A Global–Local Contrastive Learning Framework for Cross-lingual Spoken Language Understanding

Qin¹,

Chen²,

Xie³

et al. 2022

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

Due to high data demands of current methods, attention to zero-shot cross-lingual spoken language understanding (SLU) has grown, as such approaches greatly reduce human annotation effort. However, existing models solely rely on shared parameters, which can only perform implicit alignment across languages. We present Global-Local Contrastive LEarning Framework (GL-CLEF) to address this shortcoming. Specifically, we employ contrastive learning, leveraging bilingual dictionaries to construct multilingual views of the same utterance, then encourage their representations to be more similar than negative example pairs, which achieves to explicitly aligned representations of similar sentences across languages. In addition, a key step in GL-CLEF is a proposed Local and Global component, which achieves a fine-grained crosslingual transfer (i.e., sentence-level Local intent transfer, token-level Local slot transfer, and semantic-level Global transfer across intent and slot). Experiments on MultiATIS++ show that GL-CLEF achieves the best performance and successfully pulls representations of similar sentences across languages closer.

show abstract

Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems

Cited by 20 publications

References 286 publications

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Data Augmentation and Learned Layer Aggregation for Improved Multilingual Language Understanding in Dialogue

GL-CLeF: A Global–Local Contrastive Learning Framework for Cross-lingual Spoken Language Understanding

Contact Info

Product

Resources

About