From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

Goot, Rob van der; Sharaf, Ibrahim; Imankulova, Aizhan; Üstün, Ahmet; Stepanović, Marija; Ramponi, Alan; Khairunnisa, Siti Oryza; Komachi, Mamoru; Plank, Barbara

doi:10.18653/v1/2021.naacl-main.197

Cited by 26 publications

(39 citation statements)

References 32 publications

(43 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Results Our main results (Figure 4) show the baselines against ISO, AOC, and WSE of both datasets. We evaluate with two types of F1, following van der Goot et al [20]: strict and loose-F1. For full model fine-tuning, RoBERTa achieves 91.31 and 98.55 strict and loose F1 on Sayfullina respectively.…”

Section: Analysis Of Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Skill Extraction from Job Postings using Weak Supervision

Zhang¹,

Jensen²,

Goot³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Aggregated data obtained from job postings provide powerful insights into labor market demands, and emerging skills, and aid job matching. However, most extraction approaches are supervised and thus need costly and time-consuming annotation.To overcome this, we propose Skill Extraction with Weak Supervision. We leverage the European Skills, Competences, Qualifications and Occupations taxonomy to find similar skills in job ads via latent representations. The method shows a strong positive signal, outperforming baselines based on token-level and syntactic patterns.

show abstract

Section: Analysis Of Resultsmentioning

confidence: 99%

“…Definition F1 As mentioned, we evaluate with two types of F1-scores, following van der Goot et al [20]. The first type is the commonly used span-F1, where only the correct span and label are counted towards true positives.…”

Section: Tablementioning

confidence: 99%

Skill Extraction from Job Postings using Weak Supervision

Zhang¹,

Jensen²,

Goot³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Most existing datasets, however, either cover multiple domains in a single language (Hakkani-Tür et al, 2016; or the same domain across different languages (Xu et al, 2020). Fortunately, the most recent generation of NLU datasets (Li et al, 2021;van der Goot et al, 2021;Majewska et al, 2022) is both multi-lingual and multi-domain, thus opening up the possibility to assess the true generality of current cross-lingual transfer approaches. Table 3: Multilingual DST datasets.…”

Section: Natural Language Understanding (Nlu)mentioning

confidence: 99%

“…Secondly, since direct translation still dominates multilingual ToD data collection, there have been several approaches to lower human effort in the translation procedure. In most cases translators would simultaneously annotate the datasets with slot labels and/or dialogue states, depending on the tasks the dataset covers Xu et al, 2020;van der Goot et al, 2021). One approach simplifies the translation process itself, which typically proceeds in two stages: (i) machine translation into the target language; (ii) manual post-editing by native speakers of the language (Zuo et al, 2021;Hung et al, 2022).…”

Section: Outlook For Multilingual Tod Datasetsmentioning

confidence: 99%

Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems

Razumovskaia

Glavaš

Majewska

et al. 2022

jair

View full text Add to dashboard Cite

In task-oriented dialogue (ToD), a user holds a conversation with an artificial agent with the aim of completing a concrete task. Although this technology represents one of the central objectives of AI and has been the focus of ever more intense research and development efforts, it is currently limited to a few narrow domains (e.g., food ordering, ticket booking) and a handful of languages (e.g., English, Chinese). This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field. We find that the most critical factor preventing the creation of truly multilingual ToD systems is the lack of datasets in most languages for both training and evaluation. In fact, acquiring annotations or human feedback for each component of modular systems or for data-hungry end-to-end systems is expensive and tedious. Hence, state-of-the-art approaches to multilingual ToD mostly rely on (zero- or few-shot) cross-lingual transfer from resource-rich languages (almost exclusively English), either by means of (i) machine translation or (ii) multilingual representations. These approaches are currently viable only for typologically similar languages and languages with parallel / monolingual corpora available. On the other hand, their effectiveness beyond these boundaries is doubtful or hard to assess due to the lack of linguistically diverse benchmarks (especially for natural language generation and end-to-end evaluation). To overcome this limitation, we draw parallels between components of the ToD pipeline and other NLP tasks, which can inspire solutions for learning in low-resource scenarios. Finally, we list additional challenges that multilinguality poses for related areas (such as speech, fluency in generated text, and human-centred evaluation), and indicate future directions that hold promise to further expand language coverage and dialogue capabilities of current ToD systems.

show abstract

“…Preliminary findings showed that, among the Other cases, about 56 of the completions provided by BERT are unacceptable and 34 of them are dubious acceptable i.e. not clearly recognizable as acceptable 6 , as in the case of the following sentence 7 : Secondo gli esperti, in Italia i giovani leggono meno i giornali rispetto ai giovani di altri Paesi europei, ... rispetto agli anni passati i giovani tra i 14 e i 19 anni leggono più spesso i giornali. [perché anche però].…”

Section: Testing the Sensitivity Of Neural Language Models To Connect...unclassified

Language Transfer for Identifying Diagnostic Paragraphs in Clinical Notes

Liello¹,

Uryupina²,

Moschitti³

2022

Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021

View full text Add to dashboard Cite

The eighth edition of the Italian Conference on Computational Linguistics (CLiC-it 2021) was held at Università degli Studi di Milano-Bicocca from 26th to 28th January 2022.After the edition of 2020, which was held in fully virtual mode due to the health emergency related to Covid-19, CLiC-it 2021 represented the first moment for the Italian research community of Computational Linguistics to meet in person after more than one year of full/partial lockdown. Although the conference was held in dual mode, we strongly suggested the participants to attend it coming to Milan. Indeed, we received a strong feedback on this aspect from the community, which was eager to meet in person and enjoy both the scientific and social events together with the colleagues. In total, 99 participants registered to the conference benefiting from the early registration fee, 91 out of which expressed their intention to attend the event in person, which we consider as a very positive indication of enthusiasm from the community, given the uncertain situation due to the evolution of the pandemic in Italy.In total, we received 68 proposals, organized in the following specific tracks: Information Extraction,

show abstract

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

Cited by 26 publications

References 32 publications

Skill Extraction from Job Postings using Weak Supervision

Skill Extraction from Job Postings using Weak Supervision

Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems

Language Transfer for Identifying Diagnostic Paragraphs in Clinical Notes

Contact Info

Product

Resources

About